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@ Recombinant thermostable DMA polymerase from archaebacterla. 

@ Recombinant DNA polymerases from 
archaebacteria as well as isolated DNA coding 
for such polymerases are provided. The isolated 
DNA is obtained by use of DNA or antibody 
probes prepared from the DNA encoding T. 
litoralis DNA polymerase and the T. litoralis DNA 
polymerase respectively. Also provided are 
methods for producing recombinant 
archaebacteria thenmostable DNA polymerase 
and methods for enhancing the expression of 
such polymerases by identifying, locating and 
removing introns from within the DNA coding 
for such DNA polymerases. 
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FIELD OF THE INVENTION 

The present invention relates to recombinant DNA polymerases from archaebacterium, to isolated DNA 
coding for said DNA polymerases which hybridizes to DNA probes prepared from the DNA sequence coding 
5 for T: litomlis DNA polymerase, to DNA and antibody probes employed in the isolation of said DNA, as well as 
to related methods for Isolating said DNA and methods of identifying, locating and removing intervening nu- 
cleotide sequences within said DNA in order to enhance expression of said DNA polymerases 

BACKGROUND OF THE INVENTION 

10 

DNA polymerases are a family of enzymes involved In DNA repair and replication. Extensive research has 
been conducted on the isolation of DNA polymerases from mesophilic microorganisms such as E. colL See, 
for example. Bessman, et al., J. Biol. Chem. (1957) 233:171-177 and Buttin and Kornberg J, Biol. Chem.. (1966) 
241:5419-5427. 

15 Examples of DNA polymerases isolated from E. Coll include E. co// DNA polymerase I. Klenow fragment 

of E. CO// DNA polymerase I and T4 DNA polymerase. These enzymes have a variety of uses in recombinant 
DNA technology including, for example, labelling of DNA by nick translation, second-strand cDNA synthesis In 
cDNA cloning, and DNA sequencing. See Maniatis, et al., Molecular Cloning: A Laboratory Manual (1982). 
Recently, U.S. Patent Nos. 4,683,195, 4,683,202 and 4,800,159 disclosed the use of the above enzymes 

20 in a process for amplifying, detecting, and/or doning nucleic acid sequences. This process, commonly referred 
to as polymerase chain reaction (PGR), involves the use of a polymerase, primers and nucleotide triphosphates 
to amplify existing nucleic acid sequences. 

Some of the DNA polymerases discussed above possess a 3*-5' exonuclease activity which provides a 
proofreading function that gives DNA replication much higher fidelity that It would have If synthetis were the 

25 result of only a one base-pairing selection step. Brutlag, D. and Kornberg, A., J. Biol. Chem., (1 972) 247:241- 
248. DNA polymerases with 3'-5' proofreading exonuclease activity have a substantially lower base incorpor- 
ation error rate when compared with a non-proofreading exonuclease-possessing polymerase. Chang, L.M.S., 
J. Biol. Chem., (1977) 252:1873-1880. 

Research has also been conducted on the isolation and purification of DNA polymerases from thenmo- 

30 philes, such as Thermus aquaticus. Chien, A., etal. J. Bacteriol.. (1976) 127:1550-1557, discloses the isolation 
and purification of a DNA polymerase with a temperature optimum of 80'*C from T. aquaticus YT1 strain. The 
Chien, et al., purification procedure involves a four-step process. These steps involve preparation of crude ex- 
tract, DEAE-Sephadex chromatography, phosphocellulose chromatography, and chromatography on DNA cel- 
lulose. Kaledin, et al., Biokhymiyay (1980) 45:644-651 also discloses the Isolation and purification of a DNA 

35 polymerase from cells of T. aquaticus YT1 strain. The Kaledin. et al. purification procedure involves a six-step 
process. These steps involve isolation of crude extract, ammonium sulfate precipitation, DEAE-cellulose chro- 
matography, fractionation on hydroxyapatite, fractionation on DEAE-cellulose, and chromatography on single- 
strand DNA-cellulose. 

United States Patent No. 4,889,818 discloses a purified thermostable DNA polymerase from 7^ aquaticus, 

40 Taq polymerase, having a molecular weight of about 86,000 to 90,000 daltons prepared by a process substan- 
tially identical to the process of Kaledin with the addition of the substitution of a phosphocellulose chromatog- 
raphy step in lieu of chromatography on single-strand DNA-cellulose. In addition, European Patent Application 
0258017 discloses Taq polymerase as the preferred enzyme for use in the PCR process discussed above. 
Research has indicated that while the Taq DNA polymerase has a 5-3' polymerase-dependent exonu- 

45 ' dease function, the Taq DNA polymerase does not possess a 3'-5' proofreading exonuclease function. Lawyer, 
F.C., et al. J. Biol. Chem., (1989) 264:11, p. 6427-6437. Bernard, A, et aL Ce//(1989) 59:219. As a result, Taq 
DNA polymerase is prone to base incorporation enrors, making its use in certain applications undesirable. For 
example, attempting to clone an amplified gene is problematic since any one copy of the gene may contain an 
error due to a random misincorporation event Depending on where in the replication cycle that error occurs 

50 (e.g., in an early replication cyde), the entire DNA amplified could contain the erroneously incorporated base, 
thus, giving rise to mutated gene product. Furthermore, research has indicated that Taq DNA polymerase has 
a thermal stability of not more than several minutes at 100*^0. 

Accordingly, other DNA polymerases with comparable or improved thenmal stability and/or 3' to 5' exonu- 
dease proofreading activity would be desirable for the scientific community. One such enzyme (described in 

55 more detail below), DNA polymerase from Thermococcus litoralls, an archaebacterium that grows at temper- 
atures dose to lOO'^C near submarine thermal vents, has been doned into E. co//. The production of large 
amounts of this recombinant enzyme protein from this gene is complicated, however, by the presence of two 
introns, one of which must be removed by genetic engineering techniques, and the other which encodes an 
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endonuclease which is spliced out in E. coll. 

It would be desirable to obtain and produce other highly themiostable DNA polymerases from archaebac- 
terium which have a 3' to 5' proofreading activity and/or comparable or improved themnal stability so as to inrv 
prove the DNA polymerase processes described above. 

SUMMARY OF THE INVENTION 



In accordance with the present invention, there is provided methods and products for identifying, isolating 
and cloning DNA which encodes DNA polymerases from archaebacteria. The present invention also relates to 

10 recombinant DNA polymerases from archaebacteria as well as to methods of improving expression of said re- 
combinant DNA polymerases by identifying, locating and removing intervening nucleotide sequences orintrons 
which occur within the DNA coding for said polymerases. 

More specifically, in accordance with the present invention, it has been discovered that DNA coding for 
DNA polymerases from archaebacterium have substantial homology both at the DNA and amino acid level. It 

15 has also been discovered that the DNA from archaebacterium coding for such enzymes appear to have one 
of more intervening nucleotides or introns which also share substantially homology at the DNA level. 

Thus, in accordance with the present invention, DNA probes can be constructed from the DNA sequence 
coding for one DNA polymerase from archaebacterium, such as Thermococcus Irtoralis, and used to identify 
and isolate DNA coding for DNA polymerases from other Archaebacterium such as Pyrococcus. Similariy, an- 

20 tibody probes which are cross-reactive with Z litoralis DNA polymerase can also be used to identify DNA coding 
coding sequences which express such other DNA polymerases. 

Once the DNA coding for the target DNA polymerase has been isolated, it can be used to construct ex- 
pression vectors in order to produce commercial quantitaties of the target DNA polymerase. In this regard, the 
present invention also provides methods of increasing expression levels of the target DNA polymerase by iden- 

25 tifying, locating and removing any intervening nucleotide sequences or introns which occur in the DNA se- 
quence coding for the DNA polymerase. As discussed below, while certain introns are spliced out in E. coll., 
expression of the recombinant DNA polymerase can be enhanced by removal of such intervening nucleotide 
sequences prior to expression in E. coli. 



30 BRIEF DESCRIPTION OF THE DRAWINGS 
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is a photograph of the SDS-polyacrylamide gel of example 1 . 

is a graph showing the polymerase activity and exonuclease activity of the proteins eluted from 
lane 2 of the gel in Fig. 1A. 

35 FIG. 2 - is a restriction site map of the Xba fragment containing the gene encoding the 7: litoralis DNA 
Polymerase which is entirely contained within the BamHI fragment of bacteriophage NEB 619. 
Figures 3A and 3B are graphs showing the half-life of native and recombinant T. litoralis DNA, 
respectively, 

is a graph showing the response of T. litoralis DNA polymerase and Klenow fragment to the pres- 
ence or absence of deoxynucleotides. 

is a restriction site map showing the organization of the 7: litoralis DNA polymerase gene in native 
DNA (BamHI fragment of NEB 619) and in E. coli NEB671 and NEB687. 
is a partial nucleotide sequence of the 14 kb BamHI restriction fragment of bacteriophage 
NEB61 9 inclusive of the 1 .3 kb, 1 .6 kb and 1 .9 kb Eco Rl fragments and part of the Eco RI/BamHI 
45 ' fragment. 

FIG. 7 - is a comparison of the amino acids in the DNA polymerase consensus homology region III with 

the amino acids of the T. litoralis homology island III. 
FIGS. 8-10 are representations of the vectors pPR969 and pCAS4 and V174-1B1 , respectively. 
FIG. 11 - is a graph illustrating the T. litoralis DNA polymerase variant constr\jcted in Example VI lacks de- 
50 tectable 3'to 5' exonuclease activity. 

FIG. 12 - is a nucleotide sequence of the primers used in Example III. 

FIG. 13A- is the ethidium bromide stained agarose gel of Pyrococcus sp. DNA cut with EcoR I (lane 3), 
BamH I (lane 4) and Hind III (lane 5). Lane 1 is XDNA cut with Hind III as markers and lane 2 is 
pBR322 as a mariner. 

55 FIG. 1 SB- is an autoradiography of a Southern hybridization of the same gel in Fig. 1 3A. The 32p-DNA probe 
was prepared from a 1.3 Kb Eco Rl fragment that encodes the amino terminal portion of the 7: 
litoralis DNA polymerase. Note that the BamH I cut Pyrococcus sp. DNA gives a single band of 
about 4-5 Kb with the probe. The fact that the 23 Kd band of Hind III cut XDNA shows up on the 



3 



EP 0 547 920 A2 



film is due to'nonspecific hybridization to the targe amount of DNA present in that band. The fact 
that the plasmid pBR322 lights up is due to homologous sequences in the probe. 

FIG. 14 - is a restriction site map of the 4.8 Kb BamH I fragment containing the gene containing the Pyr- 
ococcus sp. DNA polymerase in the pUC19 plasmid of E coli 2207 (NEB#720). 
5 FIG. 15 - illustrates the probes used to analyze the similarity of DNA for other target archaebacteria. 

FIG. 16 - is an autoradigraph of quadruplicate Southern blots decribed in Example XIV illustrating the hy- 
bridization of probes to T. litoralis and Pyrococcus sp. DNA but not to T. aquaticus DNA. 

FIG. 17 - is a Western blot of crude lysates from T. litoralis (V), Pyrococcus sp, G-l-J (J), Pyrococcus sp. 

G-l-H (H), or purified polymerases from Pyrococcus sp. GB-D (DV), T. aquaticus (T) or 5. coli 
10 (E) reacted with affinity purified anti-vent DNA polymerase antibody in Part A or anti-Taq DNA 

polymerase antibody in Part B. M represents the marker proteins. The arrow indicates the position 
of the 7. litoralis and Pyrococcus sp. DNA polymerase proteins. The reactivity in Part B is to back- 
ground proteins and not to the DNA polymerases as seen in Part A. 

FIG. 1 8 - is a partial DNA nucleotide sequence of the gene coding for the Pyrococcus sp. DNA polymerase. 
15 FIG. 19 - is a comparison of the deduced amino acid sequence of Pyrococcus sp. DNA polymerase to T. 
litoralis DNA polymerase. 



DETAILED DESCRIPTION OF THE INVENTION 



20 In accordance with one prefenred embodiment of the present invention, there is provided a method of pro- 

ducing recombinant DNA polymerase from archaebacterium. The preferred process comprises 1) forming a 
genomic library from the target archaebacterium, 2) transfonming or transfecting an appropriate host cell, 3) 
either i) reacting the DNA from the transformed or transfected host cells with a DNA probe which hybridizes to 
the DNA coding for the DNA polymerase from T. litoralis, or ii) reacting the extract from the transformed or trans- 

25 fected host ceils with an antibody probe which is cross-reactive with T. litoralis DNA polymerase, 4) assaying 
the transfonned or transfected ceils of step 3 which either hybridize to the DNA probe or cross react with the 
T. litoralis specific antibody for thermostable DNA polymerase activity. 

The aforementioned method allows for the production of recombinant DNA polymerases from archaebac- 
terium, as well as for the isolation of DNA coding for said polymerases. 

30 In accordance with another preferred embodiment, there is provided a method for enhancing the expression 

of recombinant DNA polymerases from archaebacterium. As noted above, it is believed that the DNA coding 
for DNA polymerases from archaebacterium may possess one or more introns which may complicate expres- 
sion of the target recombinant DNA polymerase. Location and removal of these introns prior to constructing 
the expression system has been found to enhance expression of the target DNA polymerase, even when the 

35 intron is nonmally spliced out in its host cell. As discussed in more detail below, the intron can be identified and 
removed in a number of ways. In particular, it has also been found that the introns of T. litoralis share substantial 
homology at the DNA level with other genuses of archaebacteria such as Pyrococcus. Knowledge of this fact 
should facilitate the identification, location and removal of introns by the methods described in more detail be- 
low. 

40 In practicing certain embodiments of the present invention it is preferable to employ either i) DNA probes 

which hybridize to the DNA coding for T. litoralis DNA polymerase, or ii) antibodies which cross-react with T. 
litoralis DNA polymerase. DNA probes are preferably constructed based on the DNA sequence coding for the 
7. litoralis DNA polymerase (See Fig. 6), while the antibody probes are preferably made from the purified 7 
litoralis enzyme itself. Following the procedures of the present invention, one could, of course construct probes 

45 ' based on the DNA polymerase or its DNA from other sources of archaebacterium. However, the preferred DNA 
polymerase and DNA used to construct such probes is from 7 litoralis. 



Production of Native 7 litoralis DNA Polymerase 



50 7 litoralis DNA polymerase is obtainable from 7 litoralis strain NS-C (DSM No. 5473. a sample of which 

has also been deposited at the American Type Culture Collection on September 17, 1991 under ATCC Acces- 
sion No. 55233). 7 litoralis was isolated from a submarine thermal vent near Naples, Italy in 1985. This organ- 
ism, 7 litoralis, is an extremely thermophilic, sulfur metabolizing, archaebacteria. with a growth range between 
55'C and gS'^C. Neuner, et al., Arch. Microbiol, (1990) 153:205-207. 

55 For recovering the native protein, 7 litoralis may be grown using any suitable technique, such as the tech- 

nique described by Belkin. et al.. Arch Microbiol. (1985) 142:181-186, the disclosure of which is incorporated 
by reference. Briefly, the cells are grown in the media described above containing 10 mg/ml of sulfur and 0.01 
M cysteine in 1 5 ml screw cap tubes at 95''C for 2 days. When larger amounts of cells are required. 1 liter screw 
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cap bottles are used and after sterilization are inocculated with a fresh 10 ml culture and grown at 90-95?C for 
2 days. 

After cell growth, one preferred method for isolation and purification of the enzyme is accomplished using 
the multi-step process as follows; 

5 First, the cells, if frozen, are thawed, suspended in a suitable buffer such as buffer A (10 mM KPO4 buffer, 

pH 7.4; 1.0 mM EDTA. 1.0 mM beta-mercaptoethanol). sonicated and centrifuged. The supernatant is then 
passed through a column which has a high affinity for proteins that bind to nucleic acids such as Affigel blue 
column (Biorad). The nucleic acids present in supernatant solution of T. litoralis and many of the proteins pass 
through the column and are thereby removed by washing the column with several column volumes of low salt 

10 buffer at pH of about 7.0. After washing, the enzyme is eluted with a linear gradient such as 0.1 to 2.0 M NaCf 
buffer A. The peak DNA polymerase activity is dialyzed and applied to a phosphocellulose column. The column 
is washed and the enzyme activity eluted with a linear gradient such as 0.1 to 1.0 M NaCI in buffer A. The peak 
DNA polymerase activity is dialyzed and applied to a DNA cellulose column. The column is washed and DNA 
polymerase activity is eluted with a linear gradient of 0.1 to 1.0 M NaCI in buffer A. The fractions containing 

15 DNA polymerase activity are pooled, dialyzed against buffer A and applied to a high perfonnance liquid chro- 
matography column (HPLC) mono-Q column (anion exchanger). The enzyme is again eluted with a linear gra- 
dient such as 0.05 to 1 .0 M NaCI in a buffer A. The fractions having thermostable polymerase activity are pooled, 
diluted and applied to HPLC mono-S column (cation exchanger). The enzyme is again eluted with a linear gra- 
dient such as 0.05 to 1 .0 M NaCI in buffer A. The enzyme is about 50% pure at this stage. The enzyme may 

20 be further purified by precipitation of a contaminating lower molecular weight protein by repeated dialysis 
against buffer A supplemented with 50 mM NaCI. 

The apparent molecular weight of the DNA polymerase obtainable from T. litoralis is between about 90,000 
to 95.000 daltons when compared with protein standards of known molecular weight, such as phosphorylase 
B assigned a molecular weight of 97,400 daltons. It should be understood, however, that as a protein from an 

25 extreme thermophile, T. litoralis DNA polymerase may electro phorese at an aberrant relative molecular weight 
due to failure to completely denature or other intrinsic properties. The exact molecular weight of the thenmo- 
stable enzyme of the present invention may be determined from the coding sequence of the T. litoralis DNA 
polymerase gene. The molecular weight of the eluted product may be determined by any technique, for exanrv 
pie, by SDS-polyacrylamide gel electrophoresis (SDS-PAGE) using protein molecular weight mariners. 

30 Polymerase activity is preferably measured by the incorporation of radioactively labeled deoxynucleotides 

into DNAse-treated, or activated, DNA; following subsequent separation of the unincorporated deoxynucleo- 
tides from the DNA substrate, polymerase activity is proportional to the amount of radioactivity in the acid- 
insoluble fraction comprising the DNA. Lehman, I.R., et al., J. BioL Chem. (4958) 233:163, the disclosure of 
which is incorporated herein by reference. 

35 The half-life of the DNA polymerase of the present invention at 100°C is about 60 minutes. The thermal 

stability or half-life of the DNA polymerase is determined by preincubating the enzyme at the temperature of 
interest in the presence of all assay components (buffer, MgCl2, deoxynucleotides, and activated DNA) except 
the single radioactively-labeled deoxynucleotide. At predetemiined time intervals, ranging from 4-1 80 minutes, 
small aliquots are removed, and assayed for polymerase activity using the method described above. 

40 The half-life at lOO^C of the DNA polymerase can also be determined in the presence of stabilizers such 

as the nonionic detergent octoxynol, commonly known as TRITON X-100 (Rohm & Haas Co.), or the protein 
bovine serum albumin (BSA). The non-ionic detergents polyoxyethylated (20) sorbitan monolaurate (Tween 20, 
ICI Americas Inc.) and ethoxylated alkyi Phenol (nonyl) (ICONOL NP-40, BASF Wyandotte Corp.) can also be 
used. Stabilizers are used to prevent the small amount of enzyme added to the reaction mixture from adhering 

45 ' to the sides of the tube or from changing its structural conformation in some manner that decreases its enzy- 
matic activity. The haif-life at 100*'C of the DNA polymerase obtainable from 7: litoralis in the presence of the 
stabilizer TRITON X-100 or BSA is about 95 minutes. 

Preparation Of Recombinant T. litoralis DNA Polymerase 

50 

T litoralis DNA polymerase may also be produced by recombinant DNA techniques, as the gene encoding 
this enzyme has been cloned from T. litoralis genomic DNA. The complete coding sequence for the T. litoralis 
DNA polymerase (Figure 6) can be derived from bacteriophage NEB #619 on an approximately 14 kb BamHI 
restriction fragment This phage was deposited with the American Type Culture Collection (ATCC) on April 24. 
55 1 990 and has Accession No. ATCC 40795. 

The production of a recombinant form of T. litoralis DNA polymerase generally includes the following steps: 
DNA is isolated which encodes the active form of the polymerase, either in its native form or as a fusion with 
other sequences which may or may not be cleaved away from the native fomn of the polymerase and which 
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may or may not effect polymerase activity. Next, the gene is operably linked to appropriate control sequences 
for expression in either prokaryotic or eukaryotic host/vector systems. The vector preferably encodes alt func- 
tions required for transformation and maintenance in a suitable host, and may encode selectable markers 
and/or control sequences for T. litoralis polymerase expression. Active recombinant thermostable polymerase 
5 can be produced by transformed host cultures either continuously or after induction of expression. Active ther- 
mostable polymerase can be recovered either from within host cells or from the culture media if the protein ts 
secreted through the cell membrane. 

While each of the above steps can be accomplished in a number of ways, it has been found that for cloning 
the DNA encoding T. litoralis DNA polymerase, expression of the polymerase from its own control sequences 

10 in E coli results in instability of the polymerase gene, high frequency of mutation in the polymerase gene, slow 
cell growth, and some degree of cell mortality. 

While not wishing to be bound by theory, it is believed that this instability is due at least in part to the pres- 
ence of a 1614 bp intron that splits the 7. litoralis DNA polymerase gene from nucleotides 1776 to 3389 of Fig. 
6, and a second 1170 bp intron that splits the T. litoralis DNA polymerase gene from nucleotides 3534 to 4703. 

15 As discussed below, intervening sequences are also believed to be present in the DNA coding for DNA poly- 
merases from other archaebacteria. Introns from a number of archaebacteria are also believed to share sub- 
stantial homology to the introns present in the DNA for coding for T. litoralis DNA polymerase, which, in accor- 
dance with one aspect of the present invention, will facilitate their identification, location and removal. 

Introns are stretches of intervening DNA which separate coding regions of a gene (the protein coding re- 

20 gions are called exons). Introns can contain nonsense sequences or can code for proteins. In order to make 
a functional protein, the intron must be spliced out of the pre-mRNA before translation of the mature mRNA 
into protein. Introns were originally identified in eukaryotes, but have been recently described in certain pro- 
karyotes. (See, e.g., Krainer and Maniatis (Transcription and Splicing (1988) B.D. Hames and D.M. Glover, eds. 
IRL Press, Oxford and Washington, D.C. pp. 131-206)). When a gene with an intron is transcribed into mRNA 

25 the intron may self- splice out to fonm a mature mRNA or cellular factors may be required to remove the intron 
from the pre-mRNA. Id Bacterial introns often require genus specific co-factors for splicing. For example, a 
Bacillus intron may not be spliced out in E. coli. (Id). 

However, there is some evidence that suggests that the intervening DNA sequence within the gene coding 
for the 7. litoralis DNA polymerase is transcribed and translated, and that the peptide produced therefrom is 

30 spliced out at the protein level, not the mRNA level. Therefore, regardless of where the splicing event occurs, 
in order to express T. litoralis DNA polymerase in E, coll, it is preferable to delete the intervening sequence 
prior to expression of the polymerase in an E. coli system. Of course, the recombinant vector containing the T. 
litoralis DNA polymerase gene could be expressed in systems which possess the appropriate factors for splicing 
the intron, for example, a Thermococcus system. 

35 It is also preferable that the 7 litoralis thermostable polymerase expression be tightly controlled in E. coli 

during cloning and expression. Vectors useful in practicing the present invention should provide varying de- 
grees of controlled expression of T. litoralis polymerase by providing some or all of the following control fea- 
tures : (1) promoters or sites of initiation of transcription, either directly adjacent to the start of the polymerase 
or as fusion proteins, (2) operators which could be used to turn gene expression on or off, (3) ribosome binding 

40 sites for improved translation, and (4) transcription or translation termination sites for improved stability. Ap- 
propriate vectors used in cloning and expression of T. litoralis polymerase include, for example, phage and plas- 
mids. Example of phage include Xgtll (Promega), X Dash (Stratagene) X Zapll (Stratagene). Examples of plas- 
mids include pBR322, pBluescript (Stratagene), pSP73 (Promega), pGW7 (ATCC No. 40166), pET3A (Rosen- 
berg, et al.. Gene, (1987) 56:125-135), and pETIIC (Methods in Enzymology (1990) 185:60-89). 

45 ' 

Transformation and Infection 

Standard protocols exist for transformation, phage infection and cell cultur-e. Maniatis, et al.. Molecular 
Cloning : A Laboratory Manual (1982). Of the numerous B. coli strains which can be used for plasmid trans- 

50 fomaation, the preferred strains include JM101 (ATCC No. 33876), XL1 (Stratagene). and RRI (ATCC No. 
31343), and BL21(DE3) plysS (Method in Enzomology (^990) supra). E, co// strain XL1, ER1578 and ER1458 
(Raleigh, etal., NA. Research (1988) 1 6:1563-1 575) are among the strains that can be used for lambda phage, 
and Y1089 can be used for lambda gtll lysogeny. When preparing transient lysogens in Y1089 (Arasu, et al., 
Experimental Parasitology {^9Q7) 64:281-289), a culture is infected with lambda gtll recombinant phage either 

55 by a single large dose of phage or by co-culturing with a lytic host The infected Y1 089 cells are preferably grown 
at 37''C in the presence of the inducer IPTG resulting in buildup of recombinant protein within the lysis-defective 
host/phage system. 
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Constmction of Genomic DNA Expression Library and Screening for Thermostable Polymerase 

The most common methods of screening for a gene of choice are (1) by hybridization to homologous genes 
from other organisms, (2) selection of activity by complementation of a host defect, (3) reactivity with specific 

5 antibodies, or (4) screening for enzyme activity. For T. litoralis, antibody detection is preferred since it initially 
only requires expression of a portion of the enzyme, not the complete active enzyme. The instability of the T. 
litoralis polymerase gene in E. coli would have made success by other methods more difficult. 

T. litoralis DNA can be used to construct genomic libraries as either random fragments or restriction enzyme 
fragments. The latter approach is preferred. Preferably, Eco Rl partials are prepared from T. litoralis genomic 

10 DNA using standard DNA restriction techniques such as described in Maniatis, etal., Molecular Cloning : A lab- 
oratory Manual (1982), the disclosure of which is incorporated herein by reference. Other restriction enzymes 
such as BamHI, Nrul and Xbal can also be used. 

Although methods are available to screen both plasmids and phage using antibodies (Young and Davis, 
PNAS, (1983) 80:1194-1198), in accordance with the present invention it has been found that phage systems 

15 tend to work better and are therefore preferred for the first libraries. Since it is uncertain whether T. litoralis con- 
trol regions function in E. coli, phage vectors which supply all necessary expression control regions such as 
lambda gt11 and lambda Zap It, are prefenred. By cloning T. litoralis DNA into the Eco Rl site of lambda gtll, T. 
litoralis polymerase may be expressed either as a fusion protein with beat-galactosidase or from its own en- 
dogenous promoter. 

20 Once formed, the expression libraries are screened with mouse anti- 7: litoralis DNA polymerase antiserum 

using standard antibody/plaque procedures such as those described by Young and Davis, PNAS (1983), supra. 
The mouse anti- 7! litoralis DNA polymerase antiserum used to screen the expression libraries can be pre- 
pared using standard techniques, such as the techniques described in Hariow and Cane. Antibodies: A Labo- 
ratory Manual {^9QB) CSH Press, the disclosure of which is incorporated herein by reference. Since most sera 

25 read with E. coli proteins, it is preferable that the T. litoralis polymerase antisera be pre absorbed by standard 
methods against E. coli proteins to reduce background reactivity when screening expression libraries. Phage 
reacting with anti- 7: litoralis polymerase antiserum are picked and plaque purified. Young and Davis. PNAS 
(1983), supra. 

The 7; litoralis DNA polymerase DNA. coding for part or the whole gene, can then be subcloned in, for ex- 
30 ample, pBR322. pBluescript, m13 or pUC19. If desired, the DNA sequence can be determined by, for example, 
the Sanger dideoxy chain-terminating method (Sanger, F., Nicklen, S. & Coulson, A.R. PNAS (1 977) 74:5463- 
5467). 

Identification of DNA Encoding and Expression of the T. litoralis DNA Polymerase 

35 

Several methods exist for determining that the DNA sequence coding for the T. litoralis DNA polymerase 
has been obtained. These include, for example, comparing the actual or deduced amino-tenminal sequence of 
the protein produced by the recombinant DNA to the native protein, or determining whether the recombinant 
DNA produces a protein which binds antibody specific for native T. litoralis DNA polymerase. In addition, re- 

40 search by Wang, et al., FASEB Journal (1989) 3:20 suggests that certain regions of DNA polymerase sequenc- 
es are highly conserved among many species. As a result, by comparing the predicted amino acid sequence 
of the cloned gene product with the amino acid sequence of known DNA polymerases, such as human DNA 
polymerase and £. coli phage T4 DNA polymerase, the identification of these islands of homology provides 
strong evidence that the recombinant DNA indeed encodes a DNA polymerase. Once identified, the DNA se- 

45 ' quence coding for the T. litoralis DNA polymerase, can be cloned into an appropriate expression vector such 
as a plasmid derived from E. coli, for example, pET3A, pBluescript or pUC19, the plasmids derived from the 
Bacillus subtilis such as pUB110. pTP5 and pC194, plasmids derived from yeast such as pSH19 and pSH15, 
bacteriophage such as lambda phage, bacteria such as Agrobacterium tumefaciens, animal viruses such as 
retroviruses and insect viruses such as Baculovirus. 

50 As noted above, in accordance with the present invention, it has been found that DNA coding for T. litoralis 

DNA polymerase contains two introns: i) an 1614 bp intron or inten/ening sequence, spanning from nucleotides 
1776 to 3389 in Figure No. 6, and ii) an 1170 bp intron, spanning nucleotides 3534 to 4703 in Figure 6, This 
1170 bp intron codes for an endonuclease and is found to self-splice out in E. coli. Prior to overex press ion in 
host cells such as E. coli, it is preferable to delete the DNA sequence coding for both the 1614 and 1170 bp 

55 introns. Even though the 1170 bp intron splices out in E. coli., it has been found that expression vectors which 
do not contain this intron result in increased production of the desired polymerase. 

In general, once an intron has been identified and located within a nucleotide sequence, there are a number 
of approaches known in the art which can be used to delete DNA sequences and therefore splice out an intron 
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in-vftro. One method involves identifying unique restriction enzyme sites in the coding region which are near 
the splice junction or area to be deleted. A duplex oligomer is synthesized to bridge the gap between the two 
restriction fragments. A three-part ligation consisting of the amino end restriction fragment, the bridging oligo 
and the carboxy end restriction fragment yields an intact gene with the intron deleted. 

5 Another method is a modification of the above-described method. The majority of the intron is deleted by 

cutting with restriction enzymes with unique sites within the intron, but close to the coding sequence border. 
The linear plasmid containing a deletion of the majority of the intron is ligated together. Single strand phage 
are generated from the pBluescript vector recombinant by super infection with thef1 helper phage IR1. Asingel 
strand oligomer is synthesized with the desired final sequence and is annealed to the partially deleted intron 

10 phage DNA. The remainder of the intron is thus looped out. By producing the original phage in E. coll strain 
CJ236 the Kunkel method of mutagenesis Methods in Enymology 154:367 (1987)) can be used to select for 
the full deleted intron constructs. 

Yet another method which can be used to delete the intron uses DNA amplification. See, for example, Man- 
iatis, et al.. Molecular Cloning: A Laboratory Manual, (1989) Vol. 2, 2nd edition, the disclosure of which is herein 

15 incorporated by reference. Briefly, primers are generated to amplify and subsequently join the amino and car- 
boxyl halves of the gene. 

When an intron is deleted m-vitro, using the methods discussed above, the native splice Junction may be 
unknown. Accordingly, one skilled in the art would predict that several possible artificial splice Junctions exist 
that would result in the production of an active enzyme. 

20 Once the intron is deleted, overexpression of the T. litoralis DNA polymerase can be achieved, for example, 

by separating the T. litoralis DNA polymerase gene from its endogenous control elements and then operably 
linking the polymerase gene to a very tightly controlled promoter such as a T7 expression vector. See, Rosen- 
berg, et al., Gene (1987) 56:125-135, which is hereby incorporated by reference. Insertion of the strong pro- 
moter may be accomplished by Identifying convenient restriction targets near both ends of the T. litoralis DNA 

25 polymerase gene and compatible restriction targets on the vector near the promoter, or generating restriction 
targets using site directed mutagenesis (Kunkel (1984) supra), and transfennng the 7: litoralis DNA polymerase 
gene into the vector in such an orientation as to be under transcriptional and translational control of the strong 
promoter. 

T. litoralis DNA polymerase may also be overexpressed by utilizing a strong ribosome binding site placed 
30 upstream of the T. litoralis DNA polymerase gene to increase expression of the gene. See, Shine and Dal gam o, 
Proc. Natl. Acad. Sci. USA (1974) 71:1342-1348, which is hereby incorporated by reference. 

The recombinant vector is introduced into the appropriate host using standard techniques for transforma- 
tion and phage infection. For example, the calcium chloride method, as described by Cohen, S,N., P/VAS (1972) 
69:2110 is used for E. coU, the disclosure of which is incorporated by reference. The transformation of Bacillus 
35 is carried out according to the method of Chang, S., et al., Molecular and General Genetics (1979) 168:111, 
the disclosure of which is incorporated by reference. Transfonmation of yeast is canried out according to the 
method of Parent, et al., Yeast (1985) 1:83-138, the disclosure of which is incorporated by reference. Certain 
plant cells can be transfonmed with Agrobacterium tumefaciens, according to the method described by Shaw, 
C.H., et a!., Gene (1983) 23:315, the disclosure of which is incorporated by reference. Transformation of animal 
40 cells is earned out according to, for example, the method described in Virology (1973) 52:456, the disclosure 
of which is incorporated by reference. Transfonmation of insect cells with Baculovirus is canned out according 
to, for example, the method described in Biotechnology (1988) 6:47, the disclosure of which is incorporated 
herein by reference. 

The transfonmants are cultivated, depending on the host cell used, using standard techniques appropriate 
45 ' to such cells. For example, for cultivating E. coli, cells are grown in LB media (Maniatis. supra) at 30°C to 42'C 
to mid log or stationary phase. 

The T. litoralis DNA polymerase can be isolated and purified from a culture of transformed host cells, for 
example, by either extraction from cultured cells or the culture solution. 

When the T. litoralis DNA polymerase is to be extracted from a cultured cell, the cells are collected after 
50 cultivation by methods known in the art, for example, centrifugation. Then, the collected cells are suspended 
in an appropriate buffer solution and disrupted by ultrasonic treatment, lysozyme and/or freeze-thawing. A crude 
extract containing the 7. litoralis DNA polymerase is obtained by centrifugation and/or filtration. 

When the T. litoralis DNA polymerase is secreted into the culture solution, i.e., alone or as a fusion protein 
with a secreted protein such as maltose binding protein, the supernatant is separated from the cells by methods 
55 known in the art. 

The separation and purification of the T. litoralis DNA polymerase contained in the culture supernatant or 
the cell extract can be performed by the method described above, or by appropriate combinations of known 
separating and purifying methods. These methods include, for example, methods utilizing solubility such as 
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salt precipitation and solvent precipitation, methods utilizing the difference in molecular weight such as dialysis, 
ultra-filtration, gel-filtration, and SDS-polyacrylamide gel electrophoresis, methods utilizing a difference In elec- 
tric charge such as ion-exchange column chromatography, methods utilizing specific affinity such as affinity 
chromatography, methods utilizing a difference in hydrophobicity such as reverse-phase high performance Hq- 
5 uid chromatography and methods utilizing a difference in isoelectric point such as isoelectric focusing electro- 
phoresis. 

One preferred method for isolating and purification of the recombinant enzyme is accomplished using the 
multi-stage process as follows. 

First, the cells, if frozen are thawed, suspended in a suitable buffer such as Buffer A (100 mM NaCI, 25 
10 mM Tris pH 7.5, 0.1 mM EDTA, 10% glycerol, 0.05% Triton X-100), lysed and centrifuged. The clarified crude 
extract is then heated to 75°C for approximately 30 minutes. The denatured proteins are removed by centrifu- 
gation. The supernatant is then passed through a column that has high affinity for proteins that bind to nucleic 
acids such as Affigel Blue column (Biorad). The nucleic acids present in the supernatant solution and many of 
proteins pass through the column and are thereby removed by washing the column with several column vol- 
ts umes with low-salt buffer at pH of about 7.0. After washing, the enzyme is eluted with a linear gradient such 
as 0.1 M to 1.5 M NaCI Buffer A, The active fractions are pooled, dialyzed and applied to a phosphocellulose 
column. The column is washed and DNA polymerase activity eluted with a linear gradient of 0.1 to 1.0 M NaCI 
in Buffer B (100 M NaCI, 15 mM KPO4, 0.1 mM EDTA, 10% glycerol, 0.05% Triton X-100, pH 6.8). The fractions 
are collected and BSA is added to each fraction. The fractions with DNA polymerase activity are pooled. The 
20 T. litoralis DNA polymerase obtained may be further purified using the standard product purification techniques 
discussed above. 

Stabilization and Use of the 7: litoralis DNA Polymerase 

25 For long-term storage, the thermostable enzyme of the present invention is stored in the following buffer 

0.05 M NaCI, 0.01 M KPO4 (pH 7.4), 0.1 mM EDTA and 50% glycerol at-20°C. 

The T. litoratis DNA polymerase of the present invention may be used for any purpose in which such an 
enzyme is necessary or desirable. For example, in recombinant DNA technology including, second-strand 
cDNA synthesis in cDNA cloning, and DNA sequencing. See Maniatis, et al., supra, 

30 The T. litoralis DNA polymerase of the present invention may be modified chemically or genetically to in- 

activate the 3'-5' exonuclease function and used for any purpose in which such a modified enzyme is desirable, 
e.g., DNA sequencing. 

For example, genetically modified T. litoralis DNA polymerase may be isolated by randomly mutagenizing 
the T. litoralis DNA polymerase gene and then screening for those mutants that have lost exonuclease activity, 
35 without loss of polymerase activity. Alternatively, genetically modified T. litoralis DNA polymerase is preferably 
isolated using the site-directed mutagenesis technique described in Kunkel, T.A., PNAS (1985) 82:488-492, 
the disclosure of which is herein incorporated by reference. 

In addition, the 7^ litoralis DNA polymerase of the present invention may also be used to amplify DNA, e.g., 
by the procedure disclosed in U.S. Patent Nos. 4,683,195, 4,683,202 and 4,800,159. 

40 

Construction of Genomic DNA Library and Screening for Thermostable Polymerase from Archaebacteria 
other than T. litoralis 

In accordance with the present invention, cross hybridization of a target Archae bacterium genomic DNA 
45 ' library using an DNA probe prepared from the DNA polymerase gene of 7: litoralis and/or cross-reactivity with 
mouse anti-7! litoralis antiseaim allows for the identification and isolation of the DNA polymerase genes from 
other archaebacterium, such as Methanococcus, Methanobacter, Methanomicrobtum, Halobacter, Thermo- 
plasma, Thenmococcus, Pyrococcus, and the like (see, e.g. Woese, C, Microbiological Reviews, pp. 221-270, 
June 1 987, the disclosure of which is hereby incorporated by reference). 
50 In general, DNA from other archaebacterium can be isolated using the method described above. As with 

7. litoralis The archaebacterium DNA once isolated can be used to construct genomic libraries as either random 
fragments or restriction enzyme fragments. The latter approach is preferred. This approach generally entails 
cutting the target genomic DNA with various restriction enzymes and probing the firagments so fonned with, for 
example, a T. litoralis DNA probe. A library is thereafter fonned from one or more of the enzymes which produce 
55 a single hybridization band and which are about 4Kb or large enough to at least code for the molecular weight 
of the target DNA polymerase. 

Although methods are available to screen both ptasmids and phage using antibodies or DNA probes (Young 
and Davis, PNAS (1983) 80:1194-1198; Maniatis et al, supra) in accordance with the present invention it has 
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been found the phage systems tend to work better and are therefore prefenred for the first libraries. 

Genomic libraries can be screened using the colony or plaque hybridization procedure (Maniatis, et al. su- 
pra) or using the antibody plaque DNA procedures. In the colony or plaque hybridization procedure, DNA probes 
may be formed by labelling a polymerase gene from a related organism, for example, T. litoralis. The genomic 

5 library is hybridized with labeled probe under conditions which depend on the stringency desired, which may 
be experimentally determined in each case as described below. 

Specifically, although each archaebacterium will require its own set of hybridization conditions, in order to 
maximize the detectability of the target DNA, several basic approaches can be followed. Optimum hybridization 
conditions and probes can be detenmined for each target archaebacterium, for example, by performing test 

10 Southern blots at various temperatures. Hybridization is typically canried out in 4X SET, 0.1 M sodium phos- 
phate, pH 7.0, 0.1% Na pyrophosphate, 0.1% SDS, IX Denhardts solution (Maniatis, supra). Probe selection 
can also vary with respect to size and regions of the 7; litoralis DNA polymerase gene (Fig. 6). Optimum probes 
can be determined for a target archaebacterium by performing test Southern blots as described above with large 
or small DNA fragments, or even oligomers. One could, for example, select probes that are totally within one 

15 of the intervening sequences of T. litoralis to screen for intervening sequences in the target archaebacterium's 
DNA polymerase gene, or such probes could be limited to mature polymerase coding regions. 

In general, the DNA probe could be the entire sequence of Figure 6, or a portion thereof. The DNA probe 
should be at least 20 nucleotides in length, preferably at least about 50 nucleotides in length, most preferably 
at least about 150 nucleotides in length. Three such DNA probes which may be used are the 1.3 kb fragment 

20 (nucleotides 1 to 1274 of Figure 6), the 1.6 kb fragment (nucleotides 1269 to 2856 of Figure 6), and the 1.9 kb 
fragment (nucleotides 2851 to 4771 of Figure 6). 

As with T. litoralis, the DNA coding for the target archaebacterium DNA polymerase may also be obtained 
using an antibody/plaque procedure. When genomic expression libraries are screened using the antibody/pla- 
que procedure, since it is uncertain whether the target archaebacterium's control regions will function in E. coli, 

25 phage vectors which supply all necessary expression control regions such as Xgtll and X2ap II are preferred 
for antibody screening. By cloning archaebacterium DNA into an appropriate site such as the EcoR I site of 
Xgtll, the archaebacterium's DNA polymerase may be expressed either as a fusion protein with beta- 
galactosidase in Xgtll and XZapll or from its own endogenous promoter. 

Once formed, the expression libraries can be screened either with anti-archaebacterium DNA polymerase 

30 antiserum from the target archaebacterium or, by antibody against the DNA polymerase of a closely related 
organism (i.e. T. litoralis, another extreme thermophile) using standard antibody/plaque procedures such as 
those described by Young and David, PNAS (1983), supra. 

Using either procedure, the archaebacterium DNA polymerase DNA, coding for part or the whole gene, 
once identified can then besubcloned in, for example, pBR322, pBluescript. M13orpUC19. If desired, the DNA 

35 sequence can be determined by, for example, the Sanger dideoxy chain-tenminating method (Sanger. F., Nick- 
len, S. & Coulson, A.R. PNAS (1977) 74:5463-5467). 

Identification of the DNA Encoding the DNA Polymerase 

40 Once the genomic DNA expression library has been constructed and the target DNA coding for the arch- 

aebacterium DNA has been identified by use of DNA probes or antibody cross-reactivity from T. litoralis, one 
may confirm that a DNA polymerase sequence has been obtained as described above for T. litoralis. The re- 
sulting clone may be sequenced by standard methods such as by Sanger dedioxy sequencing. 

<5 ' Identification, Location and Removal of Intervening Sequencing and Overexpression of the DNA Polymerase 

In accordance with another aspect of the present invention, it has been found that the DNAcodingforDNA 
polymerases from other archaebacterium also contain one or more intervening nucleotide sequences or introns. 
Moreover, it has been found that not only do such infrons share substantial homology with the introns found in 

so T. litoralis, they appear to be located in the same positions. More specifically, in accordance with the present 
invention, introns have been Identified in the Pol a conserved region motifs in both T. litoralis and Pyrococcus 
sp. DNA polymerase genes. Without wishing to be bound by theory, it is believed that other archaebacteria also 
possess one or more intervening sequences in the coding region for their DNA polymerases. These introns 
can be identified in two ways. If the intron(s) is related to the intron(s) located in T. litoralis and/or Pyrococcus 

55 sp. DNA polymerases genes, they can be identified by low stringency hybridization to DNA probes derived from 
the intron sequences of T. litoralis or Pyrococcus sp. DNA polymerase genes. Secondly, once the archaebac- 
terium DNA polymerase gene has been identified and isolated as described above, its DNA polymerase gene 
can be sequenced at the DNA level and the sequence compared to (1) other DNA polymerases to identify non- 



10 



EP 0 547 920 A2 

similar segments, or (2) conserved motifs to look for the absence of one or more of Regions l-VI, followed by 
identification of interruption points in the Region(s) which are absent 

Once identified, the intron(s) can be removed in vitro by, for example, the techniques described above and 
in the Examples for removal of the two introns in the T. litoralis DNA polymerase gene, 
5 The following examples are given to illustrate embodiments of the present invention as it is presently pre- 

ferred to practice. It will be understood that the examples are illustrative, and that the invention is not to be 
considered as restricted except as indicated in the appended claims. 

EXAMPLE I 

10 

PURIFICATION OF A THERMOSTABLE DNA POLYMERASE FROM THERMOCOCCUS LITORALIS 

7^ litoralis strain NS-C (DSM No. 5473) was grown in the media described by Belkin, et al. supra, containing 
10 g/l of elemental sulfur in a 100 liter fenmentor at its maximal sustainable temperature of approximately 80°C 

15 for two days. The cells were cooled to room temperature, separated from unused sulfur by decanting and col- 
lected by centrifugation and stored at -70°C. The yield of cells was 0.8 g per liter. 

183 g of cells obtained as described above, were suspended in 550 ml buffer A (10 mM KPO4 buffer, pH 
7.4; 1.0 mM EDTA, 1.0 mM beta-mercaptoethanol) containing 0.1 M NaCI and sonicated for 5 minutes at4°C. 
The lysate was centrifuged at 15,000 g for 30 minutes at 4°C. The supernatant solution was passed through 

20 a 470 ml Affigel blue column (Biorad). The column was then washed with 1000 ml of buffer A containing 0.1 M 
NaCI. The column was eluted with a 2000 ml linear gradient from 0.1 to 2.0 M NaCI in buffer A. The DNA poly- 
merase eluted as a single peak at approximately 1 .3 M NaCI and represented 80% of the activity applied. The 
peak activity of DNA polymerase (435 ml) was dialyzed against 4 liters of buffer A, and then applied to 80 ml 
Phosphoceilulose column, equilibrated with buffer A containing 0.1 M NaCI. The column was washed with 160 

25 ml of buffer A containing 0.1 M NaCI, and the enzyme activity was eluted with 1 000 ml linear gradient of 0.1 to 
1.0 M NaCI in buffer A. The activity eluted as a single peak at 0.6 M NaCI and represented 74% of the activity 
applied. The pooled activity (150 ml) was dialyzed against 900 ml of buffer A and applied to a 42 ml DNA-cel- 
lulose column. The column was washed with 84 ml of buffer A containing 0.1 M NaCI, and the enzyme activity 
eluted with a linear gradient of buffer A from 0. 1 to 1 .0 M NaCI. The DNA polymerase activity eluted as a single 

30 peak at 0.3 M NaCI, and represented 80% of the activity applied. The activity was pooled (93 ml). The pooled 
fractions were dialyzed against 2 liters of buffer A containing 0.05 M NaCI and then applied to a 1.0 ml HPLC 
mono-Q column (Phamiacia). The DNA polymerase activity was eluted with a 100 ml linear gradient of 0.05 M 
to 1 .0 M NaCI in buffer A. The DNA polymerase activity eluted as a single peak at 0.1 M NaCI and represented 
16% of the activity applied. The pooled fractions (3.0 ml) were diluted to 6 ml with buffer A and applied to an 

35 1.0 ml HPLC mono-S column (Pharmacia) and eluted with a 100 ml linear gradient in buffer A from 0,05 to 1.0 
M NaCI. The activity eluted as a single peak at 0.19 M NaCI and represented 75% of the activity applied. 

By SDS-poIyacrylamide gel electrophoresis (SDS-PAGE) and subsequent staining of the proteins using a 
colloidal stain (ISS Problue) more sensitive than Coomassie Blue (Neuhoff, et al., Electrophoresis (1 988) 9:255- 
262), it was determined that the DNA polymerase preparation was approximately 50% pure: two major bands 

40 were present, one at 90,000 to 95,000 daltons and a doublet at 18,000 daltons. Figure No. 1 A. A very minor 
band was evident at approximately 80,000 to 85,000 daltons. At this level of purification the polymerase had a 
specific activity of between 30,000 and 50,000 units of polymerase activity per mg of polymerase protein. On 
a separate SDS-polyacrylamide gel verification of the identity of the stained band at 90,000 to 95,000 daltons 
was obtained by cutting the gel lane containing the purified T. litoralis polymerase into 18 slices. Embedded 

45 ' proteins were eluted from the gel by crushing the gel slices in a buffer containing 0.1 % SDS and 100^g/ml BSA. 
The eluted proteins were denatured by exposure toguanidine HCI, then renatured via dilution of the denaturant 
as described by Hager and Burgess Analytical Biochemistry (1980) 109:76-86. Polymerase activity as meas- 
ured by incorporation of radioactivity labeled ^zp-dCTP into acid-insoluble DNA (as previously described) and 
assayed for exonudease activity (as measured by the release of ^H-la belled DNA to an acid soluble fomi as 

50 described in Example V). As shown in Figure No. IB, only the 90,000 to 95,000 daltons band alone showed 
either significant polymerase activity or exonudease activity. 

The DNA polymerase preparation was dialyzed against buffer Acontaining 0.05 M NaCI. As was detenmined 
by SDS-PAGE, much of the 1 8,000 dalton protein precipitated out of the solution. The yield of 7: iiioralis DNA 
polymerase was determined to be 0.5 mg by quantitative protein analysis, and this represented 6.5% of the 

55 total activity present in the starting crude extract 

Purified 7^ litoralis polymerase was electro phoresed and stained with either Coomassie Blue or the colloidal 
stain (ISS Problue) previously described to detect protein. One deeply staining protein band was seen at about 
90,000 to 95,000 daltons; this molecular weight determination was obtained by comparison on the same gel 
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to the migration of the following marker proteins (Bethesda Research Laboratories): myosin, 200,000 daltons; 
phosphoryiase B, 97,400 daltons; BSA, 68,000 daltons; ovalbumin, 43,000 daltons, carbonic anhydrase 29,000 
daltons; b-lactoglobulin, 18,400 daltons; lysoyzme 14,300 daltons. 

5 EXAMPLE II 

CLONING OF T. LITORAUS DNA POLYMERASE GENE 

A. PRODUCTION OF MOUSE ANTI-T: UTORALIS DNA POLYMERASE ANTISERA 

10 

Immunization of Mice 

A 3 ml solution containing 0.4 mg of polymerase protein (obtained by the method of Example I) was con- 
centrated at 4^*0 to approximately 0.3 mi and used to inoculate two mice. The purified T, litoralis polymerase 

15 preparation consisted of four bands of approximately 85-95, 75-85, and a doublet of 1 0-25 kDal on Coomassie 
blue stained SDS-PAGE gels. As shown in Example I, the IT litoralis polymerase is approximately 90-95 kDal. 
Both T, litoralis polymerase antisera recognize all four proteins present in the immunogen. 

The immunization schedule was as follows: mouse one was immunized intra peritioneally (IP) with 20 ^g 
of T. litoralis polymerase, prepared as above, in Freunds' complete adjuvant (FCA). Seven days later, both mice 

20 were immunized IP with 50 ^g T. litoralis polymerase in FCA. Twenty-seven days later both mice were immun- 
ized IP with 30 \xg T. litoralis polymerase for mouse one and 50 ng T. litoralis polymerase for mouse two in 
Freunds' incomplete adjuvant. Mouse one was bled two weeks later and mouse two was bled 20 days later. 
Sera was prepared from blood by standard methods (Harlow and Lane, Antibodies: A Laboratory Mariual, 
1988). 

25 Anti-r. litoralis polymerase antisera was diluted in TBSTT (20 mM Tris pH 7.5, 150 mM NaCI, 0.2% Tween 

20, and 0.05% Triton-X 100) containing 1% BSA, 0.1% NaAzide, 0.1% PMSF. 

Preabsorption of Anti-7: litoralis Polymerase Antiserum Against £ coli Lysates 

30 Since most sera react with E. coli proteins, 7^ litoralis polymerase antisera were preabsorbed, using the 

following method, against E. coli proteins to reduce background reactivity when screening libraries orreconv 
binant antigens. E. coli cell paste was thawed and lysed by sonication and soluble protein was bound to Affigel 
1 0 (Biorad) as described by the manufacturer. 4 ml of E. coli resin were washed two times in TBS (TBSTT with- 
out detergents). 0.35 ml of sera was diluted approximately 1 to 5 in TBSTT, 1% BSA, 0.1% NaAzide and mixed 

35 with resin overnight at 4°C. The resin was pelleted by centrifugation and washed. The recovered preabsorbed 
sera was at a 1 to 17 dilution and was stored frozen at -20°C until use. 

For screening, preabsorbed sera was diluted as above to a final concentration of 1:200. 

B. IDENTIFICATION OF A PROBE FOR THE T. LITORAUS POLYMERASE GENE 

40 

Constaiction of a Lambda gtll Expression Library 

A probe for the 7". litoralis polymerase gene was obtained following immunological screening of a lambda 
gtll expression library. 

45 ' T. litoralis DNA was partially digested as follows: four |ig of T. litoralis DNA was digested at 37'C with five 
units of Eco Rl in a 40 iil reaction using Eco Rl buffer (Eco Rl buffer = 50 mM NaCI, 100 mM Tris pH 7.5, 20 
mM MgCtj, 10 mM BME). Three \i\ of 100 mM EDTA was added to 15 ^l samples at 30. 45 and 60 minutes. 2 
ng of T. litoralis DNA was digested for 90 minutes at 37°C with 20 units of Eco Rl in 20 ^l reaction using Eco 
Rl buffer and the reaction was stopped by adding 2 ^l of 100 mM EDTA. 0.2 ^g of each digest was electro- 

50 phoresed on an agarose gel to monitor the extent of digestion. Approximately 3 ^g of 7! litoralis DNA Eco Rl 
partials (14 ^1 from the 60-minute digest and 19 ^il from the 90-minute digest) were pooled to fomn the "Eco Rl 
poor and heated at 65°C for 15 minutes. 

0.5 ^1 of the Eco Rl pool were ligated to 0.28 ^g of Eco Rl cut, bacterial alkaline phosphatase treated lambda 
gtil DNA in a five nl reaction using standard ligation buffer (ligation buffer = 66 mM Tris pH 7.5, 1 mM ATP, 1 

55 mM spermidine, 10 mM MgCI2, 1 5 mM DTT, and 2 mg/ml gelatin) and 0.5 ^il T4 DNA ligase (New England Bio- 
labs No. 202). The ligation was performed at 16°C ovemighL 4 of this ligation reaction were packaged using 
Gigapack Gold (Stratagene) according to the manufacturers instructions. After incubation at room temperature 
for two hours, the packaged phage were diluted in 500 \i\ of SM (SM = 100 mM NaCI, 8 mM MgS04, 50 mM 
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Tris pH 7.5, 0.01% gelatin) plus three drops chloroform. The packaged Eco Rl library was called sample, V6-1 
and consisted of 1.1 x 10^ individual phage. E. co// strain ER1578 was used for phage infection. 

Immunological Screening of Lambda gtll Expression Library 

5 

The initial phage library was screened (Young, R.A. and R.W. Davis Science, (1983) 222:778-782) with a 
1:200 dilution of the antiserum produced above. 36 phage (VI 0-22 through VI 0-55) which reacted with the anti- 
litoralis DNA polymerase antiserum were picked and 16 phage were plaque purified. 
The 16 antibody positive phage were used to lysogenize E. co//K- 12 strain Y1089. Lysogens were screened 
10 for thermostable DNA polymerase activity, no activity was detected. 

Western blots (Towbin, et al., PNAS, (1979) 76:4350-4354) from these 16 lysates were probed with anti- 
7^ litoralis polymerase antisemm. All proteins from these lysates which reacted with 7! litoralis polymerase an- 
tiserum were smaller than T. litoralis polymerase, and were also smaller than beta-gala ctosidase. indicating 
that none were fusion proteins with beta-galactosidase. 
15 Eight of the 16 antibody positive phage were used to affinity purify epitope-specific antibodies from total 

antiserum (Beall and Mitchell, J. Immunological Methods , (1986) 86:217-223). 

The eight affinity purified sera were used to probe Western blots of both purified T. litoralis polymerase and 
T. litoralis crude lysates. Antibody purified from NEB 618 plaques specifically reacted with T. litoralis polymerase 
in purified and 7. litoralis crude lysates. This was strong evidence that phage NEB 618 encodes approximately 
20 38 kDal of the amino terminus of the T. litoralis polymerase. 

Characterization of Phage NEB 618 and Subcloning of Eco Rl Inserts 

Western blot analysis indicated that phage NEB 618 synthesized several peptides ranging in size from approx- 

25 imately 15-40 kDal which bound T. litoralis polymerase antisera. DNA from phage NEB 618 was purified from 
liquid culture by standard procedures (Maniatis, et al.. supra.) Digestion of NEB 618 DNA with Eco Rl yielded 
fragments of 1.3 and 1.7 kb. An Eco Rl digest of NEB 618 DNA was ligated to Eco Rl cut pBluescript DNA. 20 
]xg of pBluescriptSK+ were digested with 40 units of Eco Rl in 40^l Eco Rl buffer at 37°C for three hours, followed 
by 65'' for 15 minutes. 10 |ig of NEB 618 DNA were digested with 40 units of Eco Rl in 40 ^il Eco Rl buffer at 

30 Z7°0 for 75 minutes, followed by 65''C for 15 minutes. 1 .75 \ig of Eco Rl cut NEB 618 DNA were ligated to 20 
ng Eco Rl cut pBluescriptSK+ with one ^1 T4 DNAIigase (New England Biolabs No. 202) in 10 nl ligation buffer. 
The ligation was performed overnight at le^C. JM101 CaCI competent cells (Maniatis, etal,, supra) were trans- 
fomned with 5 iil of the ligation mixture. Of 24 recombinants examined, all but one contained the 1 .7 kb fragment; 
done V27-5.4 contained the 1.3 kb 7: litoralis DNA fragment. 

35 Antibodies from T, litoralis polymerase mouse antisera were affinity purified, as described above, on lysates 

from V27-5.4 (encoding the 1 .3 kb Eco Rl fragment) and V27-5.7 (encoding the 1 .7 kb Eco Rl fragment in pBlue- 
script) and reacted with Western blot strips containing either purified or crude T. litoralis polymerase. Antibodies 
selected on lysates of V27-5.4 reacted with T. litoralis polymerase in both crude and purified preparations. In 
addition, the first three amino acids from the N-terminal protein sequence of native 7. litoralis polymerase (me- 

40 thionine-isoleucine-leucine) are the same as in the predicted open reading frame (ORF) in the V27-5.4 clone. 
From these results it was concluded that V27-5.4 encoded the amino temninal of T. litoralis polymerase. 
The 1.3 kb Eco Rl fragment of V27- 5.4 comprises nucleotides 1 to 1274 of Figure No. 6. The insert DNA was 
large enough to encode the biggest peptides synthesized by this clone, but not the entire T. litoralis polymerase. 

45 ' C. CONSTRUCTION AND SCREENING OF 7: LITORALIS SECONDARY LIBRARIES 

Antibody screening discussed above, had identified the DNA fragment coding the amino terminal half of 
the 7! litoralis polymerase. In order to find a frag ment large enough to code for the entire gene, restriction digests 
of T: litoralis DNA were probed with the amino temninal half of the polymerase gene contained in clone V27- 

50 5.4. Restriction digests were performed in separate tubes using a master mix which contained 1.2 \ig of T. li- 
toralis DNA in 39 ^1 of restriction enzyme buffer (REB, restriction enzyme buffer = 50 mM NaCI, 10 mM Tris pH 
7.5, 20 mM MgCI2, 10 mM BME). to which 1.5-200 U of enzyme were added as followed: 1.5 U Avril, 9 U Eael, 
10 U Nhel, 20 U NotI, 9 U Spel, 20 U Xhol, 30 U Xbal, 20 U Sad, 10 U BamHI. 20 U Clal, 20 U Hindlll, 20 U 
Pstl. 12 U Nael, 10 U Seal, 12 U XmnI, 20 U EcoRV. 20 U Sal, 20 U Eco Rl, 200 U EagI, 20 U Oral, 5 U Hapl, 

55 8 U Nrul, 4 U SnaBI, 8 U StuI, 10 U Bdl, 8 U Bglll, 10 U Rsal, 10 U Haelll, 8 U Alul, 4 U Hindi. 10 U Pvull. 6 
U Sspl. One |il 10 mg/ml BSA was added to the Hindi digest Ball digest was prepared as above except there 
was 0 mM NaCI in the buffer. All digest were overnight at 37''C except Bdl which was incubated at SO'C. Digests 
were electrophoresed on agarose gels and transferred to NC (Southern, J. MoL Biol. (1975) 98:503-517). The 
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filters were probed with radiolabeled V27-5.4 DNAand hybridization was detected by autoradiography. In, most 
digests, V27-5.4 DNA hybridized to fragments greater than 20 kb, except BamHI (approximately 14 kb), Eco 
Rl (1 .3 kb), Hindlll (approximately 2.4, 5.4 kb), Xbal (approximately 8 kb), Clal (approximately 4.4, 5.5 kb). Ball 
(approximately 8.5 kb), Hindi (approximately 2,1, approximately 2.4 kb), Nrul (approximately 5.5 kb), Bglll (ap- 
5 proximately 2.9 kb), Haelll (approximately 1.3, approximately 1.4 kb) and Rsal which gave numerous small 
bands. 

Digests yielding single fragments large enough to encode the entire polymerase gene, estimated to be 2.4- 
3 kb, based on the size of the native protein, were BamHI, Xbal, and Nrul. 

10 BamHI Library 

A BamHI genomic library was constructed using lambda Dashll. Lambda Dashll is a BamHI substitution 
vector that can be used to clone 1 0-20 kb BamHI DNA fragments. 25-75 nanograms of T. litoralis genomic DNA 
digested with BamHI, as described above, was ligated to 0.5 ng BamHI digested, calf intestine phosphatase 

15 treated lambda Dashll DNA in five |il of standard ligation buffer including 0.5 ^I T4 DNAIigase (New England 
Biolabs No. 202). Three ^1 of the ligation reaction was packaged (Gigapack Plus, Stratagene) as described 
above. Plaque lifts of 8,000 plaques from the lambda Dashll library were probed with labeled gel purified 1,3 
kb Eco Rl fragment from clone V27-5.4 (Maniatis, et al., supra), 2.5% of the phage hybridized to the 1.3 kb 
Eco Rl DNA fragment two of which were plaque purified (clones lambda NEB 819 and lambda V56-9). Both 

20 phage contained a 12-15 kb BamHI fragment which hybridized to the 1.3 kb Eco Rl fragment and contained 
the approximately 8 kb Xbal and approximately 5.5 kb Nrul fragments. The BamHI insert was subcloned into 
pBR322. Colonies containing this fragment grew very pooriy and, based on the polymerase assay described 
above, failed to produce detectable levels of thermostable DNA polymerase. 

25 Xbal Library 

T. litoralis DNA digested with Xbal was cloned into the Xbal site of pUC19. Colony lifts were probed with 
radiolabeled V27-5.4 DNA. No positive clones were detected. 

The Xbal fragment from the BamHI insert in lambda NEB 619 (BamHI library above) was subcloned into 

30 the Xbal site of pUCI 9. Approximately 0.3 ^g of NEB 61 9 DNAdigested with BamHI was ligated to 0.1 ^g pUCI 9 
DNA digested with BamHI using two i^l T4 DNAIigase (New England Biolabs No. 202) in 20 ^! of standard lig- 
ation buffer. The ligation was Incubated overnight at lO'C. CaC!2 competent JM101 and XL-1 cells were trans- 
fonmed with five ^il of ligation mix and incubated overnight at 37°C (Maniatis, et al., supra). Colony lifts were 
probed with radiolabeled purified 1.3 kb Eco Rl fragment from V27-5.4 DNA. No positives were detected. Com- 

35 petent RRI cells were transformed with 10 ^1 of ligation mix and incubated overnight at 30*'C. Micro-colonies 
were picked and mint-plasmid preparations (boiling method, Maniatis, et al., supra) analyzed. Most of these 
clones contained the approximately 8 kb Xbal fragment The rationale for this latter experiment was that since 
the BamHI clones grew pooriy, there would be an increased chance of isolating a plasmid containing the T, 
litoralis polymerase gene from an Xbal colony that also grew slowly. Also, lower temperature of incubation re- 

40 suits in less copies of pUCI 9 plasmids per cell. These results provided evidence that the 7 litoralis polymerase 
gene was toxic to E. coii. Using the polymerase activity assay described above, no thermostable polymerase 
activity was detected in these dones. Restriction analysis indicated that the Xbal clones should contain the 
entire polymerase gene. See Figure No. 2. 

45 ' Nrul Libraries 

Approximately 0.3 ^ig of NEB 619 DNA (BamHI library above) cut with Nrul was ligated to 0.1 ^g of pUC19 
DNA cut with Hindi exactly as described for the Xbal library. Again, no positives were found by hybridization 
when cells were incubated at 37°C, but when transfonmants were incubated at 30'*C, many micro-colonies were 

50 observed. The majority of these micro-colonies contained the approximately 5.5 kb Nrul insert. Using the poly- 
merase activity assay described above, no thenmostable polymerase activity was detected in these colonies. 
Analysis of these colonies determined that when the direction of 7. Litoralis polymerase transcription was the 
same as Iac2 in pUC19, the colonies failed to grow at 37''C and were extremely unstable. 
However, colonies in which the direction of T. litoralis polymerase transcription was opposite of lacZ in pUCI 9, 

55 such as in done Nru21, were more stable. This indicated that transcription of 7: litoralis polymerase is detri- 
mental to E. coli, and may explain why it was so difficult to done the entire gene. Restriction mapping analysis 
indicated that the Nrul dones should contain the entire polymerase gene. See Figure No. 2. 
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Conclusions Concerning Direct Cloning of the Polymerase 

The T. litorah's is approximately 90-95 kDal which would require approximately 2.4-3,0 kb DNAto encode 
the entire gene. Restriction mapping analysis of the 1 .3 kb Eco Rl fragment, coding for the ami no-terminus of 
5 the T: IHoralis polymerase gene, found within the BamHI, Xbal and Nrul clones, discussed above, indicates that 
all three clones contain the entire polymerase gene. All of these larger dones were unstable in E coii. Therefore, 
alternate methods, as discussed below, for cloning the polymerase were tested. 

D. CLONING THE SECOND HALF OF T. UTORAUS POLYMERASE GENE 

10 

It is believed that when the entire T. litorafis polymerase gene was cloned in £ co// while under its endo- 
genous control, mutations in the gene arose. To prevent selection of inactive mutants, the polymerase gene 
was cloned from the 7; litoralis genome in 2 or more pieces which should each separably be inactive and there- 
fore not selected against Restriction mapping of the T. Wora//5 genome was used to determine which restriction 

15 enzymes would produce fragments that would be appropriate for cloning the second half of the T. litoralis poly- 
merase gene. Although the above data indicates that expression of T. litoralis polymerase was toxic for E. co//, 
it was also possible that DNA sequences themselves, in or outside of the coding region, were toxic. Therefore, 
the minimum sized fragment which could encode the entire gene was determined to be the best choice. Re- 
striction analysis indicated that there was an approximately 1.6 kb Eco Rl fragment adjacent to the 3' end of 

20 the amino terminal 1 .3 kb Eco Rl fragment (see Figure No. 2) which could possibly complete the polymerase 
gene. 

Hybridization Probe for the Second Half of the T. litoralis DNA Polymerase Gene 

25 Since none of the previous clones expressed thermostable polymerase activity, it was possible that they 

had accumulated mutations in the coding sequence and would therefore not be suitable sources of the second 
half of the gene. Hybridization probes were therefore required in order to clone the downstream fragments from 
the genome. The approximately 3.2 kb Ndel/Clal fragment from clone Nru21 {the Nru21 clone contains an ap- 
proximately 5.5 kb insert, beginning approximately 300 bp upstream from the start of the polymerase gene) 

30 was subcloned into pSP73 (Promega) creating clone NCII. CaClj competent RRI cells were transformed, as 
above, with the ligation mixture. Mini-plasmid preps of transformants were analyzed by digestion with Ndel and 
Clal and clone NCII containing the T litoralis 3.2 kb Ndel/Clal fragment was identified. This done was stable 
in E, coli The NCII insert was sequenced (Sanger, et al., PNAS, (1 977) 74:5463-5467). The Clal end was iden- 
tical to the V27-5.4 sequence (1.3 kb Eco Rl fragment coding for the amino-terminus of the T. litoralis polymer- 

35 ase). The 1.3 kb Eco Rl junction and beyond was sequenced using primers derived from the 1.3 kb Eco Rl 
fragment sequence. The Ndel end was sequenced from primers within the vector. 

Screening of Eco Rl Genomic Libraries 

40 10 ng of NCII were digested with 30 U of Eco Rl in 100 \i\ of Eco Rl buffer at 37''C for two hours. The 

approximately 1 .6 kb Eco Rl fragment was purified on DE-81 paper (Whabnan) after electrophoresis. The ap- 
proximately 1 .6 kb Eco Rl fragment was radiolabeled and used to probe the original Eco Rl lambda gtil library. 
Infection and plaque lifts were performed as above. Three positives were identified and plaque purified. All con- 
tain the approximately 1.6 kb Eco Rl fragment, but some also contain other inserts. 

45 ' An Eco Rl library was also constructed in lambda Zapll. 2 ^g of 7! litoralis DNA were digested with 20 U 
Eco Rl for five hours at 37*»C in 20^1 Eco Rl buffer and then heat treated at 65°C for 15 minutes. Approximately 
15 nanograms of T. litoralis DNA/Eco Rl was ligated to 0,5 \ig of Eco Rl cut, phosphatased lambda Zapll DNA 
(Stratagene) with 0.5 nl T4DNAIigase (New England Biolabs No. 202) in 5 jil of ligation buffer at 1 6°C overnight 
4 ^1 of ligated DNA was packaged (GigaPack Gold, Stratagene). Infection and plaque lifts were perfonmed as 

50 above. Approximately 1,500 phage were probed with radiolabeled approximately 1.6 kb Eco Rl fragment as 
above. Five hybridization positive plaques were picked and three were plaque purified. Two phage (NEB 620 
and V109-2) were rescued as pBluescript recombinants (V117-1 and V117-2) by in-vivo excision according to 
the manufacturer's instructions (Stratagene). Both contained the approximately 1 .6 kb Eco Rl fragment plus 
different second fragments. The 5' end was sequenced and corresponds to the sequence determined from 

55 NC11 (Clal/Ndel fragment). See Figure No. 2. This Eco Rl fragment contains 3/6 of the T4 DNA polymerase 
family homology islands as described by Wang, et al., supra. The 1 .6 kb Eco Rl fragment comprises nucleotides 
1269 to 2856 of Figure No. 6. 

The sequence of the 1 .6 kb Eco Rl and Clal/Ndel fragments indicated that the 1 .9 kb Eco Rl fragment may 
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be necessary to complete the polymerase gene. Lambda Zapll phage, V110-1 through V110-7. containing the 
1 ,9 kb Eco Rl fragment were identified as described above for NEB 620 using labeled probes. Two phage (V1 1 0- 
2 and V110-4) were rescued as pBIuescript recombinants (V153-2 and V153-4) by in-vivo excision according 
to the manufacturers instructions (Stratagene). Both contained the approximately 1.9 kb Eco Rl fragment plus 

5 different second fragments. The 1.9 kb Eco Rl fragment had sequence identity with the overlapping region in 
NC11. The 1.9 kb Eco Rl fragment comprises nucleotides 2851 to 4771 of Figure No. 6. 

The entire T. litoralis polymerase gene has been cloned as BamHI, Xbal and Nrul fragments which wera 
unstable and from which the active enzyme was not detected. The gene has also been cloned in four pieces 
(1.3 kb Eco Rl fragment, approximately 1.6 kb Eco Rl fragment, approximately 1.9 kb Eco Rl fragment and an 

10 Eco Rl/BamHI fragment containing the stop codon). The 1.3 kb Eco Rl fragment stably expresses the amino 
temninal portion of the polymerase. 

EXAMPLE III 

15 CLONING OF ACTIVE T. LITORALIS DNA POLYMERASE 

The T. litoralis polymerase gene found on the 14 kb BamHI restriction fragment of bacteriophage NEB619 
(ATCC No. 40795), was sequenced using the method of Sanger, et al., PNAS (1977) 74:5463-5467. 5837 bp 
of continuous DNA sequence (SEQ ID N0:1) was determined beginning from the 5' end of the 1.3 kb EcoRI 

20 fragment (position NT 1), see Figure No. 6. 

From analysis of the DNA sequence, it was determined that the polymerase gene begins at NT 291 in the 
1 .3 kb EcoRI fragment. A translation termination site beginning at NT 5397 was also located. Since the apparent 
molecular weight of T. litoralis polymerase was approximately 90-95 Kdai, it was predicted that the gene should 
be -2900 bp. Instead, a 5106 bp open reading frame (ORF) was identified with a coding capacity of 1702 amino 

25 acids (aa) or -185 Kdal. 

By sequence homology with other DNA polymerases, an example of which is set out in Figure No. 7, it was 
discovered that the T. litoralis polymerase gene was intenrupted by an intron or intervening sequence in DNA 
polymerase consensus homology region III (hereinafter "IVSI") (Wang, T., et al., FASEB Journal (1984) 3:14- 
21 the disclosure of which is herein incorporated by reference). The conserved amino acids of the consensus 

30 DNA polymerase homology region III are shown in Figure No. 7. In the Figure, the conserved amino acids are 
underlined. As can be seen in Figure No. 7, the left side of the T. litoralis homology island III (SEQ ID N0:2) 
begins at NT 1737, and homology to the consensus sequence is lost after the Asn and Ser residues. The right 
side of the 7. litoralis homology island III (SEQ ID N0:3) can be picked up at NT 3384, at the Asn and Ser re- 
sidues. When the two T. litoralis polymerase amino acid sequences were positioned so that the Asn and Ser 

35 residues overlap, as in Figure No. 7, it was evident that a good match to the DNA polymerase homology region 
III existed. 

Using the homology data, it was therefore predicted that an intervening sequence existed in the T, litoralis 
DNA separating the left and right halves of the DNA polymerase homology region III. 

In one preferred embodiment, the intervening sequence was deleted by identifying unique restriction en- 
40 zyme sites in the coding region which were near the intervening sequence splice junction. A synthetic duplex 
oligonucleotide was synthesized, and used to bridge the gap between the two restriction fragments. A multi- 
part sequential ligation of the carboxy end restriction fragments, the bridging oligonucleotide, the amino end 
restriction fragment, and the expression vector, resulted in the formation of an expression vector containing an 
intact polymerase gene with the intervening sequence deleted. 
45 ' Specifically, the DNA fragments or sequences used to construct the expression vector of the present in- 
vention containing the 7! litoralis DNA polymerase gene with the intervening sequence deleted were as follows: 
1 . An Ndel site was created by oligonucleotide directed mutagenesis (Kunkel, et a!.. Methods in Enzomol- 
ogy (1987) 154:367:382) in plasmid V27-5,4 (Example II, Part B) such that the initiation codon of the poly- 
merase coding region is contained within the Ndel site. 

50 

Original sequence . . . TTT ATG . . . 

(nucleotides 283-293) 

55 New sequence . . . CAT ATG . . . 

Sequences from the newly created Ndel site to the Clal site (approximately 528 base pairs) were util- 
ized in the construction of the expression vector. 
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2. An approximately 899 bp sequence between the Clal and Pvul site of NC11 (Example II, Part D)., 

3. Asynthetic duplex which spans the intervening sequence, connecting Pvul and Bsu36l sites derived from 
other fragments, as set out in Figure No. 12. 

In Figure No. 12, the first line indicates the original sequence at the 5' end of the splice junction (nu- 
5 cleotides 1721-1784, SEQ ID NO:1), the second line indicates the original sequence of the 3' end of the 

splice junction (nucleotides 3375-3415, SEQ IDNO:1), and the third and fourth lines indicate the sequence 
of the synthetic duplex oligonucleotide. 

4. ABsu361 to BamHI fragment, approximately 2500 base pairs, derived from bacteriophage NEB 619 (Ex- 
ample II. Part C). 

10 5. A BamHI to Ndel fragment of approximately 6200 base pairs representing the vector backbone, derived 

from pETIIc (Studier, Methods in Enzomology, (1990) 185:66-89). and which includes: 

a) The T7 phi 10 promoter and ribosome binding site for the gene 10 protein 

b) Ampicitlin resistance gene 

c) laci<i gene 

15 d) Piasmid origin of replication 

e) Afour-fold repeat of the ribosomal transcription terminators (rrnb), Simons, etal., Gene (1987) 53:85- 
96. 

The above DNA fragments, 1-5, were sequentially ligated under appropriate conditions using T4 DNAIig- 
ase. The correct construct was identified by restriction analysis and named pPR969. See Figure No. 8. pPR969 
20 was used to transfonm E. coli strain RRI, creating a strain designated NEB 687. A sample of NEB 687 was de- 
posited with the American Type Culture Collection on December 7, 1990 and bears ATCC No. 68487. 

In another preferred embodiment, the T. litoralis polymerase gene, with the intervening sequence deleted, 
was cloned into a derivative of the Studier T7 RNA polymerase expression vector pETIIc (Studier, (1990) su- 
pra). The recombinant piasmid V174-1 B1 was used to transform E. co// strain BL21 (DE3)pLysS, creating strain 
25 1 75-1B1, designated NEB671. See Figure Nos. 5 and 10. 

A sample of NEB671 was deposited with the American Type Culture Collection on October 17, 1990 and 
bears ATCC No. 68447. 

A comparison between the predicted and observed molecular weights of the polymerase, even with the 
IVS1 deleted, revealed a discrepancy. The predicted molecular weight of the polymerase after removal of IVS1 

30 in region III is 1 32 Kb, while the observed molecular weight of either the native (see Example I) or recombinant 
(see Example IV) polymerase is about 95 kD. The molecular weight discrepancy is due to an intron (hereinafter 
"1732") in homology region I. This finding is based on the following observations: The distance between hom- 
ology regions III and I varies from 15-135 amino acids in members of the pol alpha family (Wang, (1989) supra). 
In T. litoralis there are 407 amino acids or ~-44-kD separating these regions. T. litoralis DNA polymerase is very 

35 similar to human pol alpha except for 360 amino acids between conserved homology regions I and Ml where 
no similarity exists. Finally, no consensus region I is observed. 

In addition, as determined by SDS-PAGE, a thermostable endonuclease of approximately 42-47 kD is also 
produced by the T. litoralis DNA polymerase clones of the present invention (see Example X). This endonu- 
clease was purified to homogeneity by standard ion exchange chromatography, and was sequenced at its ami- 

40 no-terminal. The first 30 amino acids of the endonuclease con-espond to the amino acids encoded beginning 
at nucleotide 3534 of the polymerase clone (SEQ ID N0:1). This corresponds to the portion of the polymerase 
which lacks homology with other known polymerases This endonuclease does not react with anti-lT litoralis 
DNA polymerase antisera. While the exact mechanism by which the endonuclease is spliced out of the poly- 
merase is unknown, it occurs spontaneously in both E. coli and 7. litoralis. 

45 ' 

EXAMPLE IV 

PURIFICATION OF RECOMBINANT T. LITORALIS DNA POLYMERASE 

50 E. coli NEB671 (ATCC No. 68447) was grown in a 100 liter fermentor in media containing 10 g/liter tryptone, 
5 g/liter yeast extract, 5 g/liter NaCI and 100 mg/liter ampicillin at 35''C and induced with 0.3 mM IPTG at mid- 
exponential growth phase and incubated an additional 4 hours. The cells were harvested by centrifugatron and 
stored at -70**C. 

580 grams of cells were thawed and suspended in Buffer A (100 mM NaCI, 25 mM KPO4 at pH 7.0, 0.1 
55 mM EDTA, 0.05% Triton X-1 00 and 10% glycerol) to a total volume of 2400 ml. The cells were lysed by passage 
through a Gaulin homogenizer. The crude extract was clarified by centrifugation. The clarified crude extract 
volume was adjusted to 2200 mis with the above buffer and was heated to 75°C for 30 minutes. The particulate 
material was removed by centrifugation and the remaining supernatant contained about 3120 mg of soluble 
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protein. 

The supernatant was applied to a DEAE-sepharose column (5 x 13 cm; 255 ml bed volume) linked in series 
to a phosphocellulose column (5 x 11 cm; 216 ml bed volume). The DEAE-sepharose flow-through fraction, 
containing the bulk of the enzyme, passed immediately onto the phosphocellulose column. Both columns were 
5 washed with 300 mis Buffer A, the two columns were disconnected, and the protein on the phosphocellulose 
column was eluted with a 2 liter linear gradient of NaCI from 0.1 M to 1 M formed in Buffer A. 

The column fractions were assayed for DNA polymerase activity. Briefly, 1-4 nJ effractions were incubated 
for 5-10 minutes at 75°C in 50 ^1 of 1X 7: litoralis DNA polymerase buffer (10 mM KCI, 20 mM Tris-HCI (ph 8.8 
at24°C), 10 mM (NH4)2S04, 2 mM MgS04 and 0.1% Triton X-100) containing 30 ^iM each dNTP and ^H-labeled 
10 TIP, 0.2 mg/ml activated calf thymus DNA and 100 ng/ml acetylated BSA, although it has been found that non- 
acetylated BSA is prefenred. The mixtures were applied to Whatman 3 mm filters and the filters were subjected 
to three washes of 10% TCA followed by two washes of cold isopropanol. After drying of the filters, bound ra- 
dioactivity representing incorporation of ^H-TTP into the DNA was measured. The active fractions were pooled 
and the enzyme activity levels in each pool were assessed using the above assay conditions except the dNTP 
15 level was raised to 200 \iM each dNTP. Under these conditions one unit of enzyme activity was defined as the 
amount of enzyme that will incorporate 10 nmoles of dNTP into acid-insoluble material at 75^*0 in 30 minutes. 

The active fractions comprising a 300 ml volume containing 66 mg protein, were applied to a hydroxylapatite 
column (2.5 x 5 cm; 25 ml bed volume) equilibrated with Buffer B (400 mM NaCI, 10 mM KPO4 at pH 7.0, 0.1 
mM EDTA, 0.05% Triton X-100 and 10% glycerol). The protein was eluted with a 250 ml linear gradient of KPO4 
20 from 10 mM to 500 mM formed in Buffer B. The active fractions, comprising a 59 ml volume containing 27 mg 
protein, was pooled and dialyzed against Buffer C (200 mM NaCI, 10 mM Tris-HCI at pH 7.5, 0.1 mM EDTA, 
0.05% Triton X-100 and 10% glycerol). 

The dialysate was applied to a heparin-sepharose column (1.4 x 4 cm; 6 ml bed volume) and washed with 
20 ml Buffer C. A 100 ml linear gradient of NaCI from 200 mM to 700 mM fonmed in Buffer C was applied to the 
25 column. The active fractions, comprising a 40 ml volume containing 16 mg protein was pooled and dialyzed 
against Buffer C. 

The dialysate was applied to an Affi-gel Blue chromatography column (1 .4 x 4 cm; 6 ml bed volume), wash- 
ed with 20 ml Buffer C, and the protein was eluted with a 95 ml linear gradient from 0.2 M to 2 M NaCI formed 
in Buffer C. The active fractions, comprising a 30 ml volume containing 11 mg of protein, was dialyzed against 
30 a storage buffer containing 200 mM KCI, 10 mM Tris-HCI (pH 7.4), 1 mM DTT, 0.1 mM EDTA, 0.1% Triton X- 
100, 100 ^ig/ml BSA and 50% glycerol. 

The T. litoralis DNA polymerase obtained above had a specific activity of 20,000-40,000 units/mg. 

Characterization of recombinant T. litoralis polymerase 

35 

Recombinant and native T litoralis polymerase had the same apparent molecular weight when electrophor- 
esed in 5- 10% SDS-PAGE gradient gels. Recombinant 7. litoralis polymerase maintains the heat stability of 
the native enzyme. Recombinant T. litoralis polymerase has the same 3' — >5' exonudease activity as native 
T. litoralis polymerase, which is also sensitive to inhibition by dNTPs. 

40 

EXAMPLE V 

OVER-EXPRESSION OF THE THERMOCOCCUS LITORALIS DNA POLYMERASE GENE 

45 ' The 7: litoralis DNA polymerase gene, with IVS1 deleted, e.g., VI 74-1 B1 obtained in Example III, may be used 
in a number of approaches, or combinations thereof, to obtain maximum expression of the cloned T. litoralis 
DNA polymerase. 

One such approach comprises separating the T. litoralis DNA polymerase gerte from its endogenous control 
elements and then operably linking the polymerase gene to a very tightly controlled promoter such as a T7 ex- 

50 pression vector (Rosenberg, et al., Gene (1987) 56:125-135). Insertion of the strong promoter may be acconv 
plished by identifiying convenient restriction targets near both ends of the T. litoralis DNA polymerase gene and 
compatible restriction targets on the vector near the promoter, or generating restriction targets using site di- 
rected mutagenesis (Kunkel, (1984), supra), and transferring the T, litoralis DNA polymerase gene into the vec- 
tor in such an orientation as to be under transcriptional and translational control of the strong promoter. 

55 T. litoralis DNA polymerase may also be overexpressed by utilizing a strong ribosome binding site placed 

upstream of the T. litoralis DNA polymerase gene to increase expression of the gene. See, Shine and Dalgamo, 
Proc. Natl. Acad, Sci, USA (1974) 71:1342-1346. which is hereby incorporated by reference. 

Another approach for increasing expression of the 7! litoralis DNA polymerase gene comprises altering the 
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DNA sequence of the gene by site directed mutagenesis or resynthesis to contain initiation codons that are 
more efficiently utilized than £. colL 

Finally, T. litoralis DNA polymerase may be more stable in eukaryote systems like yeast and Baculovirus. 

The T. IHorah's DNA polymerase may be produced from clones canrying the T. litoralis DNA polymerase gene 
by propagation in a fermentor in a rich medium containing appropriate antibiotics. Cells are thereafter harvested 
by centrifugation and disrupted by sonication to produce a crude cell extract containing the T, litoralis DNA poly- 
merase activity. 

The crude extract containing the T. litoralis DNA polymerase activity is purified by the method described 
in Example I, or by standard product purification techniques such as affinlty-chromatography, or ion-exchange 
chromatography. 

EXAMPLE VI 

PRODUCTION OF A T. LITORALIS DNA POLYMERASE 3' TO 5' EXONUCLEASE MUTANT 

T litoralis DNA polymerase lacking 3' to 5* exonuclease activity was constructed using site-directed mu- 
tagenesis to alter the codons for asp141 and gtu143 to code for alanine. Site-directed mutagenesis has been 
used to create DNA polymerase variants which are reported to have reduced exonuclease activity, including 
phi29 {Cell (1989) 59:219-228) DNA polymerase I {Science (1988) 240:199-201) and T7 DNA polymerases 
(U.S. Patent No. 4,942,130). 

Site-directed mutagenesis of the polymerase of the present invention was accomplished using a modifh 
cation of the technique described by Kunkel, T.A., PNAS (1985) 82:488-492, the disclosure of which is herein 
incorporated by reference. The V27-5.4 plasmid (see Example 2, Part B) was used to constmct the site-directed 
mutants. V27-5.4 encodes the 1.3 kb EcoRI fragment in pBluescript Sk+. E. coli strain CJ236 (Kunkel, et al., 
Metfiods in Enymology{^9B7) 154:367-382), a strain that incorporates deoxyuracil in place of deoxythymidine, 
containing the V27-5.4 plasmid was superinfected with the f1 helper phage IR1 {Virology, (1982) 122:222-228) 
to produce single stranded versions of the plasmid. 

Briefly, the side-directed mutants were constructed using the following approach. First, a mutant oligonu- 
cleotide primer, 35 bases in length, was synthesized using standard procedures. The oligonucleotide was hy- 
bridized to the single-stranded template. After hybridization the oligonucleotide was extended using T4 DNA 
polymerase. The resulting double-stranded DNA was converted to a closed circular dsDNA by treatment with 
T4 DNA ligase. Plasmids containing the sought after mutations were identified by virtue of the creation of a 
Pvul site overlapping the changed bases, as set out below. One such plasmid was identified and named pAJG2. 

The original and revised sequences for amino acid residues are 141, 142, and 143: 

. . asp lie glu 
Original : . . GAT ATT GAA 

. . al a i 1 e al a 
Altered: . . G CG ATC G CA 

The newly created Pvul site, used to screen for the alteration, is underiined. Note that the middle codon was 
changed but that the amino acid encoded by this new codon is the same as the previous one. 

An approximately 120 bp Clal to Ncol fragmentfrom V174-1B1 (see Example III) was replaced by the cor- 
responding fragment bearing the above substitutions from pAJG2, creating pCAS4 (see Figure No. 9). pCAS4 
thus differs from VI 74-1 B1 by 4 base pairs, namely those described above. 

E. coli BL21 (DE3) plysS {Methods in Enzomology, (1990) 185:60-89) was transformed with pCAS4, cre- 
ating strain NEB681. Expression of the mutant T, litoralis polymerase was induced by addition of IPTG. 

Asampleof NEB681 has been deposited with the American Type Culture Collection on Novembers, 1990, 
and bears ATCC No. 68473. 

Relative exonuclease activities in the native 7. litoralis DNA polymerase and the exonuclease minus variant 
isolated from E. coii NEB681 was detennnined using a uniformly PH] labeled E. coli DNA substrate. Wild type 
T. litoralis DNA polymerase was from a highly purified lot currently sold by New England Biolabs, Inc. The ex- 
onuclease minus variant was partially purified through DEAE sepharose and phosphocellulose columns to re- 
move contaminants which interfered with the exonuclease assays. The indicated number of units of POLYMER- 
ASE were added to a 0.1 ml reaction containing 7. litoralis DNA polymerase buffer [20 mM Tris-Hcl (pH8.8 at 
25*»C), 10 mM KCI, 10 mM (NH4)2S04, 5 mM MgS04. 0.1% Triton X-IOO], 0.1 mg/ml bovine serum albumin, 
and 3 ng/ml DNA substrate (specific activity 200,000 cpm/ng) and the reaction was overiaid with mineral oil to 
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prevent evaporation of the reaction. Identical reactions contained in addition 20 \ihA dNTP, previously shown 
to inhibit the exonuclease activity of the wild type enzyme. The connplete reaction mixture was incubated at 
70°C for 60 minutes, following which 0.08 ml was removed and mixed with 0.02 ml 0,5 mg/ml sonicated herring 
spenm DNA (to aid in precipitation of intact DNA) and 0.2 ml of 10% trichloroacetic acid at 4°C. After mixing. 

5 the reaction was incubated on ice for 5 minutes, and the DNA then pelleted at4*'Cfor5 minutes in an Eppendorf 
centrifuge. 0.25 ml of supernatant was mixed with scintillation fluid and counted. The results of the sample 
counting, corrected for background, are shown in Figure No. 11. 

As illustrated in Figure No. 11, the exonuclease minus variant was substantially free of exonuclease activity 
in the presence or absence of dNTPs under conditions where the native polymerase cleariy demonstrated ex- 

10 onuclease activity. Conservatively estimating that a level of activity two-fold above background could have been 
detected, this implies that the exonuclease activity is decreased at least 60-fold in this variant 

EXAMPLE VII 

15 T. LITORAUS DNAPOLYI\/1ERASE HALF-LIFE DETERMINATION 

The thenmostability or half-life of the T. litoralis DNA polymerase purified as described above in Example 
1 was determined by the following method. Purified T. litoralis DNA polymerase (25 units) was preincu bated at 
1 00°C in the following buffer: 70 mM tris-HCl (pH 8.8 at 25''C), 1 7 mM ammonium sulfate. 7 mM MgCIj, 1 0 mM 

20 beta-mercaptoethanol, 200nM each deoxynucleotide and 200 jig/ml DNAse-treated DNA. An initial sample was 
taken at time zero and a small aliquot equivalent to 5% of the enzyme mixture was removed at 10, 20, 40, 60, 
90, 120, 150, and 180 minutes. The polymerase activity was measured by determining incorporation of deox- 
ynucleotide into DNA as described previously. 

A sample of Taq DNA polymerase obtained from New England Biolabs was subjected to the above assay. 

25 An initial sample was taken at time zero and a small aliquot equivalent to 5% of the enzyme mixture was re- 
moved at 4, 7, and 10 minutes. As shown in the Figure No. 3. the half-life of the T. fitoralis DNA polymerase at 
100°C was 60 minutes, while the half-life of the Taq polymerase at lOO^C was 4.5 minutes. 

As shown in Figure No. 3A, the half-life of T. litoralis DNA polymerase at 100**C in the absence of stabilizers 
was 60 minutes, while in the presence of the stabilizers TRITON X-100 (0.15%) or BSA (lOOng/ml) the half- 

30 life was 95 minutes. This was in stark contrast to the half-life of Taq DNA polymerases at 100°C, which in the 
presence or absence of stabilizers was 4.5 minutes. 

The themnostability or half-life of recombinant T. litoralis DNA polymerase purified as describe above in Ex- 
ample IV was found to have a biphasic heat inactivation curve at temperatures greater than about 90°C. These 
two phases were characterized by half-lives of about 5 minutes and 7 hours (Fig. SB). To provide more con- 

35 sistent behavior at extreme temperatures, an additional purification step may be used to eliminate the more 
heat sensitive component of the polymerase. 

Specifically, the final enzyme preparation of Example IV was heated at lOO^C for 15 minutes then cooled 
on ice for 30 minutes. Precipitated proteins were removed by centifugation at 12,000 xg for 10 minutes at 4°C. 
Approxiniately 20% of the initial polymerase activity was lost in this procedure. The remaining DNA polymerase 

40 showed a monophasic heat inactivation profile, with a half-life at QS'^C of about 7 hours. The resulting polymer- 
ase also showed kinetic characteristics at75°C which were similar to the native enzyme and to the recombinant 
enzyme prepared in accordance with Example IV. 

EXAMPLE VIII 

45 ' 

DETERMINATION OF 3'-5' PROOFREADING ACTIVITY 

1. Response of T. litoralis DNA Polymerase to the Absence or Presence of Deoxy nucleotides 

50 The levels of exonuclease activities associated with polymerases show very different responses to deox- 

ynucleotides. Nonproofreading 5'-3' exonucieases are stimulated tenfold or greater by concomitant polymeri- 
zation afforded by the presence of deoxynucleotides, while proofreading 3'-5' exonucieases are inhibited com- 
pletely by concomitant polymerization. Lehman, I.R. ARB (1967) 36:645. 

The T. litoralis DNA polymerase or polymerases with well-characterized exonuclease functions (T4 Poly- 

55 merase, Klenow fragment) were incubated with 1 ng ^H-thumidine-labeled double-stranded DNA (1 0^ CPM/^ig) 
in polymerization buffer (70 mM tris (pH 8.8 at 24*0), 2 mM MgClj, 0.1% Triton and 100 ng/ml bovine serum 
albumin). After an incubation period of three hours (experiment 1) or four hours (experiment 2) at either 70"'C 
(themiophilic polymerases) or 37*'C (mesophilic polymerases), the exonudease-hydrolyzed bases were quan- 
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tified by measuring the acid-sotuble radtoactively-labeled bases. 

As shown in Table 1 , the Taq DNA polymerase, with its 5-3' exonuclease activity, shows stimulation of ex- 
onuclease activity when deoxynucleotides were present at 30 uM. However, polymerases with 3'-5' proofread- 
ing exonuclease activities, such as theT4 polymerase, Klenowfragmentof E co// polymerase I, or the T. litoralis 
DNA polymerase showed the reverse, an inhibitory response to the presence of deoxynucleotides. 

The similarity of responses to the presence or absence of deoxynucleotides of the T. litoralis DNA poly- 
merase and the well-characterized Klenow fragment of the E. coli DNA polymerase is further shown in Figure 
No. 4. Twenty units of 
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either polymerase was incubated with 9 3H-thymidine-labeled double-stranded DNA (IQS CPM/|ag) in 350 
^l polymerization buffer as described above in the presence, or absence of, 30 ^iM deoxynudeotides. At each 
time point, 50 was removed and the level of acid-soluble radioactively-labeled bases were measured. As 
Figure No. 4 documents, the behavior of Z litoralis DNA polymerase and the Klenow fragment of E. co// DNA 
polymerase, which contains a well-characterized 3'-5' proofreading exonudease activity, are very similar. 

2. Response of T. litoralis DNA Polymerase to Increasing Deoxy nucleotide Concentrations 

Exonudease activities of polymerases are affected by the level of deoxynudeotides present during poly- 
merization, in as much as these levels affect polymerization. As deoxynucleotide levels are increased towards 
the Km (Michaelis constant) of the enzyme, the rate of polymerization is increased. For exonudease functions 
of polymerases sensitive to the rate of polymerization, changes in exonudease activity are parallel with increas- 
es in deoxynucleotide concentrations. The increase in polymerization rate drastically decreases proofreading 
3'-5' exonudease activity with a concomitant increase in polymerization-dependent 5'-3' exonudease activity. 

The exonudease function of the T. litoralis DNA polymerase was compared to those of well-characterized 
exonudease functions of other polymerases as the deoxynucleotide concentration was increased from 10 uM 
to 100 uM. The exonudease activity was measured as described in (1) with an incubation period of 30 minutes. 
As summarized in Table 2, the 7^ litoralis DNA polymerase responded to increases in deoxynudeotide levels 
similariy to a polymerase known to possess a 3*-5' proofreading exonudease (Klenow fragment of E. co// DNA 
Pol. I). This 
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response was in contradiction to that of a polymerase known not to possess this proofreading function, Taq 
DNA polynnerase. This polymerase responded to an increase in deoxynucleotide levels with an increase in ex- 
onuclease function due to its 5'-3' exonudease activity. 
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3. Response of T. litoralis DNA Poiymerase to Alteration from a Balanced Deoxynucleotide State to an Un- 
balanced State 

Polymerization is dependent on equal levels of all four deoxynucleotides present during DNA synthesis. If 
the deoxynucleotide levels are not equal, polymerases have decreased polymerization rates and are more likely 
to insert incorrect bases. Such conditions greatly increase proofreading 3*-5' exo nuclease activities while de- 
creasing 5-3' exonuclease activities. Lehman, I.R., ARB (1967) 36:645). 

The T. litoralis DNA polymerase was incubated with both balanced deoxynucleotide levels (30 uM) and two 
levels of imbalance characterized by dCTP present at 1/10 or 1/100 the level of the other three deoxynucleo- 
tides. The response of the T. litoralis DNA polymerase was then compared to that of three polymerases pos- 
sessing either the 3'-5' or the 5-3' exonuclease functions. All assays were performed as described in (1) except 
the dCTP concentrations listed below. As seen in Table 3 below, the T. litoralis DNA polymerase follows the 
expected behavior for a proofreading 3'-5* exonudease-containing polymerase; an imbalance in deoxynucleo- 
tide pools increased the exonuclease activity in a similar manner as that of the proofreading polymerases of 
T4 DNA polymerase or Klenow fragment of E. coli DNA polymerase I. In contrast to this response, the exonu- 
clease of the Taq DNA polymerase was not affected until the imbalance was heightened to the point that poly- 
merization was inhibited. 
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4. Directionality of Exonuclease Activity 

A proofreading exonuciease has a 3'-5' directionality on DNA while nonproofreading exonuclease associ- 
ated with DNA polymerases have a 5'-3' directionality. To discern the direction of the exonuclease activity of 
5 T. litoralis DMA polymerase, the 5' blocked DNA of adenovirus was utilized. Since the 5' end of this DNA is 
blocked by protein, enzymic activities that are 5'-3' in directionality cannot digest this double-stranded DNA; 
however, enzymic activities that are 3'-5\ such as exonuclease III or proofreading exonud ease-containing poly- 
merases, can digest adenovirus DNA. 

Twenty-five units of exonuclease III or 20 units of either T. litoralis DNA polymerase, T4 DNA polymerase 
10 (possessing a well characterized 3'-5' exonuclease activity), or Taq DNA polymerase (lacking such an activity) 
were incubated with S^ig adenovirus DNA for time periods up to 30 minutes duration at either 37"'C (T4 poly- 
merase and exonuclease III) or 70*'C (Taq polymerase and T. litoraiis polymerase) in the presence of 70 mM 
tris-HCI pH 8.8 at 25**C, 2 mM MgCtj and 100 ng/ml BSA. At the end of each incubation Ume period, enzymic 
activity was stopped by phenol extraction of the adenovirus DNA, followed by Hpal digestion for one hour at 
15 37^C in 20 mM tris, pH 7.9 at 25*'C, 10 mM Magnesium acetate 50 mM potassium acetate and 1 mM DTT. The 
DNA fragments were subjected to agarose gel electrophoresis and the resulting pattern of time-dependent deg- 
radation and subsequent loss of double-stranded DNA fragments were assessed. 

The 3'-5* exonuclease activities of exonuclease III, of T. iitoralis DNA polymerase and T4 DNA polymerase 
caused the disappearance of the double-strand DNA fragments originating from the 5' blocked end of the ade- 
20 novirus DNA, indicating vulnerability of its 3' end. In contrast, the Taq DNA polymerase with its 5*-3' polymer- 
ization-dependent exonuclease activity, showed no disappearance of the DNA fragment. 

EXAMPLE IX 

25 PERFORMANCE OF T litoralis DNA POLYMERASE IN THE PGR PROCESS 

The ability of the T. litoralis DNA polymerase to perform the polymerase chain reaction (PCR) was also 
examined. In 100 nl volumes containing the buffer described in Example IV, varying amounts of M13mp18 DNA 
cut by Clal digestion, generating 2 fragments of 4355 bp and 2895 bp, were incubated with 200 ng of calf thymus 

30 DNA present as carrier DNA to decrease any nonspecific adsorption effects. The forward and reverse primers 
were present at 1 ^M (fon^^ard primer = 5'd(CCAGCAAGGCCGATAGTTTGAGTT)3' and the reverse primer = 
5' d(CGCCAGGGTTTTCCCAGTCACGAC)3'). These primers flank a 1 kb DNA sequence on the 4355 bp frag- 
ment described above, with the sequence representing 14% of the total M13mp18 DNA. Also present were 200 
HM each dNTP, 100 jig/ml BSA, 10% DMSO and 2.5 units of either 7: aquaiicus DNA polymerase (in the pres- 

35 ence or absence of 0.5% NP40 and 0.05% Tween 20), or T. litoralis DNA polymerase (in the presence or ab- 
sence of 0.10% Triton X-100). The initial cycle consisted of 5 min at 95°C, 5 min at 50"'C (during which poly- 
merase and BSA additions were made) and 5 min at 70°C. The segments of each subsequent PCR cycle were 
the following: 1 min at 93«C, 1 min at 50°C and 5 min at 70X. After 0, 13, 23 and 40 cycles, 20 ^1 amounts of 
100 |il volumes were removed and subjected to agarose gel electrophoresis with ethidium bromide present to 

40 quantitate the amplification of the 1 kb DNA sequence. 

Initial experiments with this target DNA sequence present at 28 ng and 2.8 ng established the ability of the 
T. litoralis DNA polymerase to catalyze the polymerase chain reaction; yields were comparable or not more than 
twofold greater than the seen with T. aquaticus DNA polymerase. 

However, it was at the lower levels of target DNA sequence, 2.8 femtograms, that differences in polymerase 

45 ' function were most apparent Under these conditions requiring maximal polymerase stability and/or efficiency 
at elongation of DNA during each cycle, the T. litoralis DNA polymerase produced greater than fourfold more 
amplified DNA than that of T. aquaticus DNA polymerase within 23 cycles. 

This ability to amplify very small amounts of DNA with fewer cycles is important for many applications of 
PCR since employing large cycle numbers for amplification is associated with the generation of undesirable 

50 artifacts during the PCR process. 

EXAMPLE X 

PURIFICATION OF RECOMBINANT 7: LITORALIS INTRON-ENCODED ENDONUCLEASE 

55 

6, CO// NEB671 (ATCC No. 68447), grown as described in Example IV, were thawed (70 grams) and sus- 
pended in Buffer A containing 200 ng of lysozyme per ml to a final volume of 300 ml. The mixture was incubated 
at 37«C for 2 minutes and then 75°C for 30 minutes. The heated mixture was centrifuged at 22,00 x g for 30 
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minutes and the supernatant was collected for further purification of the thermostable endonuclease. Since all 
of the nucleases from E. co// were inactivated by the heat treatment, the preparation at this stage could be used 
for characterization of the intron-encoded endonuclease. To separate this enzyme from the recombinant T. //- 
toralis DNA polymerase also present in the 75^ supernatant solution, the solution was passed through a 

5 DEAE-sepharose column (5 cm x 5 cm, 100 mt bed volume) and washed with 200 ml of Buffer A. Essentially 
all of the DNA polymerase activity passes through the column while the endonuclease activity sticks. The en- 
donuclease activity was eluted with a one liter linear gradient of NaCt from 0.1 M to 0.8 M formed in Buffer A. 
The endonuclease activity eluted at about 0.4 M NaCI, and was assayed in a buffer containing 10 mM KCI, 20 
mM Tris-HCI (pH 8.8 at 24°C), 10 mM (NH4)4S04, 10 mM MgS04, 0.1% Triton X-100 and 1 ng of pBR322 DNA 

10 per 0.05 ml of reaction mixture. The reaction mixture was incubated at 75**C and the extent of DNA cleavage 
was detenmined by agarose gel electrophorese. At lower temperatures little or no endonuclease activity was 
detected. The tubes containing the peak activity were pooled, dialyzed ovemight against Buffer A and then ap- 
plied to phosphoceltulose column (2.5 cm x 6.5 cm, 32 m) bed volume), washed with Buffer A and the endonu- 
clease activity eluted with a linear gradient of Nad from 0.1 M to 1.5 M formed in Buffer A. The enzyme eluted 

15 at about 0.8 M NaCt. Active fractions were pooled and dialyzed overnight against Buffer A and then passed 
through a HPLC Mono-S column (Pharmacia) and eluted with a linear gradient of NaCI from 0.05 M to 1.0 M. 
The activity eluted as a single peak and was homogeneous by SDS-PAGE: a single 42-47 kd band was detected 
by Commaste blue staining and when this band was eluted from the gel and renatured it contained the only 
endonuclease activity detected on the gel. 

20 The enzyme has preferred cutting sites on various DNAs. When used in vast excess and in Vent polymer- 

ase buffer (New England Biolabs, Beverly, MA), the enzyme has cutting sites on lambda DNA and 3 sites on 
pBR322. Two of the rapid sites on pBR322 have been sequenced: 
Region including cut site at position 164: 

25 

5' TTGGTTATGCCGGTAC TGCCGGCCTCTT 3' 
3' AACCAATACGGC CATGACGGCCGGAGAA 5' 

30 

Region including cut site at position 241 1 : 



5' TTGAGTGAGCTGATAC CGCTCGCCGCAG 3' 

35 

3' AACTCACTCGAC TATGGCGAGCGGCGTC 5' 

When IVS2 was deleted from pPR969. the resultant plasmid, pAKK4 (Example XI) now contains a very 
sensitive fast site at the exon junction: 
40 Region including the cut site at IVS2 junction: 

5 ' GGTTCTTTATGCGGAC*AC /TGACGGCTTTATG 3 ' 
3 ' CCAAGAAATACGCC/TG*TGACTGCCGAAATAC 5 ' 

45 ' 

The astericks denote the boundary between the left exon and the right exon which have been brought to- 
gether by deletion of IVS2. 

Cleavage at the l-TIi I homing site occurs 100-fold more rapidly that at the "gtar" sites using reaction con- 
ditions of 50 mM TRIS. (pH 7,9), 10 mM MgClj, lOOmM NaCI and 1 mM DTT at 50°C. Under these conditions, 
50 the enzyme cut E. coli DNA 6-10 times. "Star" cleavage is enhance by NH4 (1 0 mM), higher temperatures (70- 
80^C), and higher pH (8.8-10). 

Thus, the endonuclease from T. litoralis resembles other intron encoded endonucleases reported in that 
there is often a four base 3' extension at the cut site and there can be degeneracy in the recognition sequence. 

The cut site In the intron minus gene is referred to as the homing site of the intron encoded endonuclease. 
55 It is believed in the art that the intron encoded endonuclease recognizes its cut site in the gene lacking the 
intron, and that the cutting of that DNA by the endonuclease leads to insertion of the intron at the homing site. 

The thermostable endonuclease of the present invention can be used in genetic manipulation techniques 
where such activity is desired. 



28 



EP 0 547 920 A2 

EXAMPLE XI 

Construction of 7: litoralis DNA Polymerase Expression Vectors with a Deleted IVS2 



Analysis of the deduced amino acid sequence of the 7! litoralis gene in comparison to other alpha class 
DNA polymerases and to the endonuclease in the 1170 bp intervening sequence suggested that this intron in- 
terrupted the alpha polymerase Region I. If the first 3 amino acids preceding the endonuclease (Tyr Ala Asp) 
were joined to the Thr at aa 1472, then a good consensus Region I would be established (where underlined 
residues indicate identity): 

Region I: TYR GLY AS? THR ASP ScR 

Left junction: TYR ALA ASP SER VAL S£R 

Right junction: VAL HIS ASM IHR AS? GLY 

Vent Pol Region I: lYR ALA AS? IHR AS? GLY 



To facilitate this construction, a Seal site was created in the PGR primers by changing the codon usage 
for Lys 1076 and Val 1077 as follows: 



Amino acids: 


PHE 


LYS 


VAL 


LEU TYR ALA ASP 


Original sequence: 


TTT 


AAG 


GTT 


CTT 


Altered sequence: 


TTT 


AAA 


GTA 


CTT 


Sea I site: 




A 


GTA 


CT 



The expression plasmid pAKK4 was created in a three-way ligation derived from the following components: 

1) An about 7959 bp fragment of pPR969 was derived by cleavage with Hindlll and EcoRI. 9ng of pPR969 
DNA was incubated with IX NEBuffer 2 in a total volume of 0.1 ml with 40 units of Hindlll endonuclease 
and 40 units of EcoRI endonuclease for 1 hour at 37*^0. Cleavage products were separated on a 0.7% GTG 
grade agarose gel (FMC) run in Tris Borate EDTA buffer. The appropriate band, about 8 kbp, was isolated 
by electroelution using an Elutrap eiution apparatus (Schleicher and Schuell) using the manufacturer's rec- 
ommended running conditions. Following eiution, the fragment was concentrated by ethanol precipitation 
and the recovery quantified by comparison with known weight standards on agarose gel electrophoresis. 

2) An about 638 bp fragment with Seal and EcoRI tenmini derived from a PGR product. The reaction mixture 
contained 1 X NEB Vent Polymerase Buffer, 0.1 mg/ml bovine serum alumen, 0.2 mM dNTPs (equimolar, 
each nucleotide), 0.9 \xglm\ pV174-1B1 plasmid DNA template, and 0.01 AseoU/ml of primer 72-150 
(5'ATAAAGTACTTTAAAGCCGAACTTTTCCTCTA3') and primer "JACK" (5'CGGCGCATATGATACTGGA- 
CACTGATTAC3'). 0.1 ml of the reaction mix was placed into each of five tubes, and the samples heated 
to 95°C for 3-5 minutes in a Perkin- Elmer Thermocycler. 1 U of Vent DNA polymerase was added to each 
reaction tube, and 15 cycles were run on the thenmocycler consisting of 94° C- 0.5 minutes. 50°C - 0.5 
minutes, and 72**C - 2 minutes. The samples were pooled, phenol extracted and ethanol precipitated. The 
sample was resuspended in 50 ^1 Tris-EDTA buffer and mixed with 40 nl of dHjO, 10 iil of 10X NEBuffer 
3, 60 units of Seal endonuclease and 60 units of EcoRI endonuclease. After incubation at 37°C for 1 .75 h, 
the reaction products were separated on a 1.5% agarose gel and the ca. 638 bp fragment was electroeluted, 
and quantified as described above. 

3) An about 358 bp fragment with Hindlll and Seal temnini derived from a PCR product. The reaction mixture 
contained 1 X NEB Vent Polymerase Buffer, 0.1 mg/ml bovine serum albumin, 0.2 mM dNTPs (equimolar, 
each nucleotide), 0.9 ^g/ml pV174-1B1 plasmid DNA template, and 0.02 A26o/mI of primer 698 (5'GA- 
GACTCGCGGAGAAACTTGGACT3') and primer 73-143 (5TACAGTACTTTATGCGGACACT- 
GACGGCi 1 I 1ATGCCAC3'). 0,1 ml of the reaction mix was placed into each of five tubes, and the samples 
heated to 95°C for 3-5 minutes in a Perkin-Elmer Thermocycler. 1 U of Vent DNA polymerase was added 
to each reaction tube, and 20 cycles were run on the thermocycler consisting of 94**C - 0.5 minutes, 50°C 
- 0.5 minutes, and 72*'C-1 minute. The samples were pooled, phenol extracted and ethanol precipitated. 
The sample was resuspended in 50 ^il Tris-EDTA buffer and cleaved with Hindlll and Seal endonucleases. 
The reaction products were separated on a 1.5% agarose gel and the 358 bp fragment was electroeluted, 
and quantified as described above. 

The ligation reaction contained approximately 1 ng/ml of the pPR969 fragment described above, 0.8 ng/ml 
of the 638 bp fragment described above, 0.4 ^g/ml of the 358 bp fragment described above, 1X NEB ligation 
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buffer and 100,000 units/ml T4 DNAIigase. Ligation occured at 16°C for 5 hours Correctly constructed recom- 
binants were identified by the Seal digestion pattern, and transfonmed into BI^1(DE3) plysS to screen for in- 
ducible activity, as described above. Two such isolates, pAKK4 and pAKK15 were used in subsequent studies. 
These two isolates appear to be identical, although they were isolated from independent isolates. 

5 Expression from the new construct pAKK4 appears to yield 3-10-fold more active T. litoralis DNA polymer- 

ase than pPR969 without expression of the endonuclease from the 1170 bp intron. 

An expression vector for production of the exonudease deficient variant of the 7". litoralis polymerase was 
constructed by replacing a 1417 bp Ctal-SphI fragment from pAKK15 with an analogous 1417 bp fragmentfrom 
pCBAI, the original exonuclease-deficient T. litoralis DNA polymerase construct. One such recombinant was 

10 named pAKM8 and was characterized further. 

EXAMPLE XII 

PURIFICATION OF A THERMOSTABLE DNA POLYMERASE FROM PYROCOCCUS SPECIES 

15 " ' ' 

Pyrococcus sp. strain GB-D (ATCC No. 55239) was grown in the media described by Belkin, et al., supra, 
containing 10 g/l of elemental sulfur in 8 one liter bottles at 94^0 for two days. The cells were cooled to room 
temperature, separated from unused sulfur by decanting and collected by centrifugation and stored at -70^0. 
The yield of cells was 1 .4 g per liter. 

20 11.5 g of cells obtained as described above, were suspended in 28 ml of buffer A (10 mM KP04 buffer, pH 

7.4; 0.1 mM EDTA, 1.0 mM beta-mercaptoethanol) containing 0.1 M NaCI and sonicated for 5 minutes at4°C. 
The lysate was centrifuged at 15,000 g for 30 minutes at4°C. The supernatant solution was passed through 
a 18 ml Affigel blue column (Biorad). The column was then washed with 50 ml of buffer A containing 0.1 M NaCI. 
The column was eluted with a 300 ml linear gradient from 0.1 to 2.0 M NaCI in buffer A. The DNA polymerase 

25 eluted as a single peak at approximately 1.3 M NaCI and represented 90% of the activity applied. The peak 
activity of DNA polymerase (25 ml) was dialyzed against 1 liter of buffer A containing 100 mM NaCI, and then 
applied to 15 ml Phosphocellulose column, equilibrated with buffer A containing 100 mM NaCI. The column was 
washed with 50 ml of buffer A containing 100 mM NaCI, and the enzyme activity was eluted with 200 ml linear 
gradient of 0.1 to 1.0 M NaCI in buffer A, The activity eluted as a single peak at 0.6 M NaCI and represented 

30 70% of the activity applied. The pooled activity (42 ml) was dialyzed against 500 ml of buffer A and applied to 
a 25 ml DEAE column. The column was washed with 50 ml of buffer A containing 0.1 M NaCt, and two-thirds 
of the enzyme activity passed through the column. The active fractions were pooled (30 mf) and applied to an 
1.0 ml HPLC mono-S column (Pharmacia) and eluted with a 100 ml linear gradient in buffer Afrom 0.05 to 1.0 
M NaCI. The activity eluted as a single peak at 0.22 M NaCI and represented 80% of the activity applied. 

35 Purified Pyrococcus sp. polymerase was electrophoresed in SDS 10-20% polyacrylamide gel and stained 

with either Coomassie Blue or the colloidal stain (ISS Problue) previously described to detect protein. A faintly 
staining protein band was seen at about 92,000 to 97,000 daltons; this molecular weight detemninatlon was 
obtained by comparison on the same gel to the migration of the following marker proteins (Bethesda Research 
Laboratories): myosin, 200,000 daltons; phosphorylase B, 97,400 daltons; BSA. 68,000 daltons; ovalbumin, 

40 43,000 daltons, carbonic anhydrase 29,000 daltons; b-!actoglobuIin, 18,400 daltons; lysoyzme 14,300 daltons. 

EXAMPLE XIII 

CLONING OF PYROCOCCUS SPECIES DNA POLYMERASE GENE 

45 ' 

Cross hybridization of a Pyrococcus genomic DNA library using radioactive probes prepared from the DNA 
polymerase gene of T. litoralis allowed for the identification and isolation of a DNA encoding the Pyrococcus 
DNA polymerase. This was accomplished as set forth below. 

In order to detenmine which restriction enzymes would be most useful in preparation of the Pyrococcus gen- 

50 omic library, Pyrococcus sp. DNA was cut to completion with Eco Rl, BamHI and Hindi II. This DNA was subject 
to agarose gel electrophoresis (Figure 13A) and Southern hybridization (Figure 13B) using a DNA probe pre- 
pared as follows. A reaction mixture containing 1 ng of the first EcoRI fragment of the T. litoralis DNA polymerase 
gene (bp 1-1274, obtainable from bacteriophage NEB#618, ATCC No. 40794) as a template in a commercial 
random priming kit (New England Biolabs, Inc.) was incubated for 1 hour at 37^0 to produce a DNA probe of 

55 high specific activity. The probe was hybridized to Pyrococcus sp. DNA prepared above under moderately strin- 
gent conditions (Hybridization: overnight at 50°C, 4X SET, 0.1M sodium phosphate, pH 7, 0.1% Na pyrophos- 
phate, 0.1% SDS. 1X Denhardts solution; Wash Conditions: wash 3X20-30 min. 45<*C, 0.1XSET, 0.1 M sodium 
phosphate, (pH 7), 0.1% Na pyrophosphate, 0.1% SDS. Maniatis, et al., supra). A single major band at about 
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5 Kb was detected in BamH I cut Pyrococcus DNA. EcoR I and Hind III gave multiple bands with this probe, 
indicating that these enzymes cut within the Pyrococcus polymerase gene. 

Based on the results, a BamHI genomic library was constructed using the phage vector XDASH (Strata- 
gene). Partial and complete BamHI digests of Pyrococcus DNA were prepared. A mixture of the partial and com- 

5 pletely BamHI digested DNA was ligated into the BamHI site of XDASH. The ligation mixture was packaged 
using Gigapack Gold (Stratagene) according to manufacturer's instructions and plated on E. coli ER1458. The 
packaged phage library contained 1 x 10^ phage per ml. 

32P-labelled DNA probes of the 3 fragments (bp 1-1274. 1 656-2660 and 3069-3737) of the T. titoralis DNA 
polymerase gene (obtainable from NEB#61 9, ATCC No. 40795) were prepared using a random primer kit (New 

10 England Biolabs, Inc.), The probes were used according to the method of Benton & Davis (Maniatis, et al. supra) 
to screen the Pyrococcus genomic library using hybridization conditions described above. About one per cent 
of the plaques were positive and ten positive plaques were picked and purified by reinfection and replating 3 
times (until 90-100% of the plaques were positive for each isolate). Large amounts of phage were prepared 
from each isolate and used to infect E. co// cultures. Specifically, plate lysates (Maniatis et al., supra) of phage 

15 were prepared from each isolate and used to infect E. coli cells. 0.1 ml of each plate lysate was mixed with E. 
CO// with 0.2 ml of celts (OD6oo=2). The bacterial cells were harvested just before lysis and suspended in 0,05 
M NaCI, 0.01 M Tris (pH 8.0), 0.1 mM EDTA, 0.1% Triton X-100 and 200 i^g/ml lysozyme (3 volumes pervolume 
of cells) and heated to 37**C for about 1 minute or until cell lysis occured. The lysed extracts were immediately 
heated at 75°C for 30 minutes, centrifuged and the supernatant solution assayed for heat stable DNA polymer- 

20 ase activity, according to the method described above. Three of the ten isolates showed significant polymerase 
activity and the clone (B9) showing the most activity was investigated further. 

The phage DNA was isolated from B9 and the insert DNA was examined by restriction enzyme digestion. 
Digestion with Sal I gave the expected two amis of XDASH plus a 15 Kb insert. Digestion with BamH I gave 
the two anms of XDASH plus three insert fragments of 7, 4.8 and 3 Kb. Each of these fragments were purified 

25 by agarose gel electrophoresis, eluted and ligated into the BamH I site of pUCI 9. The ligation mixture was used 
to transfomn E. coli ER2207 which gives white colonies when plasmids contain an insert and blue colonies with 
no inserts on indicator agar media (X-gal plus IPTG). No white transformants were obtained with the 7 Kb frag- 
ment Three whites and twenty-seven blue transformants were obtained with the 4.8 Kb fragment and twenty 
white and twenty-one blue transformants were obtained with the 3 Kb fragment All three 4.8 Kb white colony 

30 transfonmants expressed heat stable DNA polymerase activity. None of the transformants with the 3 Kb frag- 
ment expressed heat stable polymerase activity. The three dones carrying the 4.8 Kb Pyrococcus DNA frag- 
ment all had about the same specific activity for heat stable DNA polymerase and one was picked for further 
study (NEB#720). This done designated NEB#720 was deposited with the American Type Culture Collection 
on October 1, 1991 and bears ATCC No. 68723. A restriction endonuclease map of the 4.8 Kb BamH I fragment 

35 containing the Pyrococcus sp. DNA polymerase gene is shown in Figure 14. A partial DNA nucleotide sequence 
coding for Pyrococcus sp, DNA polymerase (NEB720) is set forth in Figure 18. induding the start of the poly- 
merase gene at bp 363 and a portion of the intervening nudeotide sequence (bp 1839-3420). NEB#720 yielded 
1700 units of DNA polymerase activity per gram of cells and was used for the large scale preparation of this 
enzyme. 

40 A portion of the Pyrococcus sp. DNA polymerase clone has been sequenced (Fig. 18, bp 1-3420). The se- 

quence of the Pyrococcus sp. DNA polymerase is very similar to the T. litoralis DNA polymerase at both the 
DNA and protein level (similarity calculated using the GCG Bestfit Program, Smith and Waterman, Advances 
in Applied Mathmatics, 2:482 (1981)). Overall, the genes are 66% identical, with 69% identity in the mature 
DNA polymerase amino termini regions (bp 363-1838 in Pyrococcus sp, DNA polymerase) and 63% identical 

45 ' in the portion of IVS1 sequenced to date (bp 1839-3420 in Pyrococcus sp. DNA polymerase). The upstream 
regions (bp 1-362 in Pyrococcus sp. DNA polymerase, Fig. 18 and bp 1-290 in 7". litoralis DNA polymerase. 
Fig. 6) show no similarity according to the Bestfit Program. 

Similarity at the protein level is even higher. In the 1019 amino acid Pyrococcus sp. DNA polymerase coding 
region, the two polymerases have 83% similarity and 68% identity (Fig. 19). When broken down into the mature 

50 polymerase amino terminus and IVS1, the polymerase coding exons are more similar than the intervening se- 
quence, with the mature polymerase amino termini (aa 1-492 in Pyrococcus sp. DNA polymerase) being 89% 
similar, and 78% identical, and IVS1 (aa 493-1019 in Pyrococcus sp. DNA polymerase) being 78% similar and 
60% identical. 

55 
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EXAMPLE XIV 

ARCHAEBACTERIA DNA POLYMERASE SIMILARITIES AT THE DNA LEVEL 

5 The degree of cross-hybridization between the T IHoralis DNA polymerase gene and the DNA polymerase 

genes from 3 other thermophylllc archaebacteria and from Taq DNA was assessed by Southern blot hybridi- 
zation (Maniatis, supra). Chromosomal DNA from T. litoralis and Pyrococcus sp. (Strain GB-D). T. aquaticus, 
and two other Pyrococcus strains, G-1-J and G-1-H, were cleaved with either EcoRI or BamHI. 5 ^g of each 
DNA was incubated with 1x NEBuffer (EcoRI buffer for EcoRI endonudease and BamHI buffer + 1X BSA for 

10 BamHI endonudease) in a total volume of 60 [il with 20 units of EcoRI endonudease or 20 units of BamHI en- 
donudease for 2 hours at 37°C. Four quadmplicate 0.75^g samples of each of the deaved DNAs were loaded 
and run on a 1% agarose (SeaKem LE) gel in Tris Acetate EDTA buffer (Maniatis, supra). The gel was stained 
with Ethidium Bromide (t^igt/m/) for 20 minutes at room temperature and a photograph taken with a ruler besides 
the gel. 

15 The DNA was transferred from the gel onto nitrocellulose paper using the method developed by Southem 

(Maniatis supra). Nitrocellulose filter paper (0.45\Wfi) was cut to the size of the gel and soaked in 200ml of 6x 
SSC (0.9M NaCI, 0.09M Sodium Citrate) for greater than 1 hour at 37°C. Meanwhile, the gel was incubated 
for 15 minutes in 200 ml of 0.25M Hydrochloric acid at room temperature, then rinsed with distilled water. The 
gel was then incubated for 30 minutes in 200 ml 0,5M Sodium Hydroxide, 1M Sodium Chloride at room tenv 

20 perature, then rinsed with distilled water. The gel was then incubated for 30 minutes in 200-mis 1M Tris HCI, 
pH7.5, 3M Sodium Chloride at room temperature. Transfer of the DNA from the gel onto the nitrocellulose was 
carried out at 4'»C in 18X SSC (2.7M Sodium Chloride, 0.27M Sodium Citrate). 1M Ammonium Acetate. After 
6 hours the nitrocellulose was removed and washed in 1x SSC (0.1 5M Sodium Chloride and 0.01 5M Sodium 
Citrate) for 30 seconds. The nitrocellulose filter was air dried and then vacuum dried at 80"*C for a further 2 

25 hours and then stored at room temperature. 

Four gel purified fragments of T. fitoralis DNA polymerase DNA, (1.3 kb Eco Rl fragment from bp 1-1274 
representing the 5' polymerase coding region; bp 471 8-5437, representing the 3* polymerase coding region; 
bp 2448-2882, representing part of IVS1 ; and bp 3656-4242, representing part of IVS2, Figures 6 and 1 5) were 
radiolabelled using the New England Biolabs Random Primer Kit. lOOng of the above template DNAs, each in 

30 a volume of 35.5 ^1, were boiled for 5 minutes in a boiling water bath and then cooled on ice for 5 minutes and 
spun down. The template DNAs were incubated with IX labelling buffer (includes random hexanucleotides), 
1/10 volume dNTP mix, 25nCi a^^p dCTP and 5 units DNA Polymerase l-Klenow fragment in a total volume of 
50|il for 1 hour at 37°C. The reactions were stopped with 0.01 8M EDTA. The probes were purified using an 
Elutip minicolumn (Schleicher and Schuell) following the manufacturers recommended elution conditions. The 

35 total number of counts were calculated for all purified probes. The 1.3 kb Eco Rl fragment probe (bp 1-1274) 
yielded 24 x lO^cpm, the 3' polymerase probe (bp 4718-5436) yielded 22 x lO^cpm, the IVS1 probe yielded 
54 X lO^cpm, and the IVS2 probe yielded 47 x lO^cpm. 

Hybridization was carried out as follows (Maniatis supra). The nitrocellulose filter was incubated for 30 min- 
utes in 5mls prehybridization buffer (0.75M Sodium Chloride, 0.15M Tris, 10 mM EDTA, 0.1% Sodium Pyro- 

40 phosphate, 0.1% Sodium Lauryl Sulphate, 0.2% Bovine Serum Albumin, 0.2% Ficoll 400, 0.2% PVP and 100 
fig/ml boiled calf thymus DNA) at 50°C. Each nitrocellulose filter was then placed in separate bags with 5mls 
hybridation buffer (as above except 0.03% Bovine serum albumin, 0.03% Ficoll 400, and 0.03% PVP). Each 
section was hybridized with 22-25 x lO^cpm of denatured probe overnight at 50°C. 

The nitrocellulose filters were removed from the bags and incubated 3 x 30 minutes with 0.1X SET Wash 

45 ' {15mM NaCI, 3mM Tris base, 0.2 mM EDTA, 0.1% SDS, 0.1% Sodium Pyrophosphate and 0.1M Phosphate 
Buffer) at 45°C. The filters were kept moist, wrapped in Saran Wrap and exposed to X-ray film for various times 
ranging from 4 hours to 3 days. 

The results are shown in Figure 16. In Figure 16, parts A through D are autoradiographs of quadruplicate 
Southern blots. Lanes 1-5, DNA cut with EcoRI. Lanes 6-10, DNA cut with BamHI. Lanes 1 & 6, Pyrococcus 

50 sp. G-1-J DNA; Lanes 2 and 7, Pyrococcus G-1-H DNA; Lanes 3 & 8, T. litoralis DNA; Lanes 4 and 9, Pyrococcus 
sp. GB-D DNA, Lanes 5 & 10, 1 aquaticus DNA. The hybridization probes are as follows: part A, 5' coding region 
of T. litoralis DNA polymerase gene, bp 1-1274; part B, 3' coding region of T. Irtoralis DNA polymerase gene, 
bp 4718-5437; Part C, partial IVS2 probe, bp 3666-4242; Part D, partial IVS1 probe, bp 2448-2882. The upper 
and lower panels of parts C and D represent shorter and longer exposures, respectfully, of the same blots. 

55 None of the 4 probes hybridized to Taq DNA. Both polymerase coding region probes hybridize to specific 

bands in all Thermococcus and Pyrococcus DNAs, but not Taq DNA. Good signals were obtained with both 
probes indicating strong conservation of both the amino and carboxy terminal ends of the T. litoralis DNA Poly- 
merase coding region. The amino temiinal regions of T. litoralis and Pyrococcus sp. GB-D are about 69% iden- 
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tical (see, e.g. Figs, 6 and 18) and very similar at the protein level (Fig. 19). The IVS1 probe hybridized strongly 
to r. litoralis and Pyrococcus sp. GB-D DMAs (about 63% identical over a 1 582 bp region) and weakly to Pyr- 
ococcus sp. G-1-H DNA. The IVS2 probe hybridized strongly to 7. tftoralis DNA and weakly to Pyrococcus sp. 
G-1-H DNA. 

5 

EXAMPLE XV 

ARCHAEBACTERIA DNA POLYMERASE SIMILARITIES AT THE ANTIBODY LEVEL 

10 Pellets fronn 1 ml cultures of T. litoralis, and Pyrococcus strains were resuspended in 1 00^1 Urea lysis buffer 

(4M Urea. 0.12M Tris, 4% Sodium Lauryl Sulphate. 10% p-mercaptoethanol, 20% glycerol and 0.002% Bro- 
mophenol Blue) and boiled for 3 minutes. The boiled samples were sheared with 25G5/8 needle to reduce the 
viscosity of the samples. Duplicate 10ial samples of T. litoralis, and Pyrococcus strains G-1-J and G-1-H, and 
also samples of purified Taq DNA polymerase, E. coli DNA polymerase and purified DNA polymerase from Pyr- 

15 ococcus sp. (GB-D) were loaded onto 10-20% SDS-PAGE gels and ain in Protein Running Buffer (0.1% Sodium 
Lauryl Sulphate, 0.19M Glycine, and 0.025M Tris Base). Nitrocellulose filters (45^m) were soaked in distilled 
water for 5 minutes and then soaked in Transfer buffer (0.15% ethanolamine, 20 mM Glycine and 20% Metha- 
nol) for 30 minutes. The protein on the gels were electroeluted (30 volts, ovemight at 4^*0) onto the nitrocellulose 
filters in Transfer buffer (Towbin, et al. PNAS (1979) 76:4350-4354). 

20 The nitrocellulose was removed, marked with a ball point pen and washed for 5 minutes in TBSTT (20 mM 

Tris, 1 50mM Sodium Chloride, 0.2% Tween 20, and 0,05% Triton X-1 00). The filters were blocked for 30 minutes 
in TBSTT + 3% nonfat dry milk (Carnation), and washed 3x3 minutes in TBSTT. The anti-T! litoralis DNA poly- 
merase antisera was raised against a partially purified native DNA polymerase preparation. 7^ litoralis DNA poly- 
merase specific sera was prepared by affinity purification on Western blot strips of purified native enzyme (Beall 

25 et al., J. Immunological Met tiods86:2U -233 (1983)). Affinity purified anti-T. litoralis DNA polymerase mouse 
antibody (V76-2+3) and monoclonal anti-Taq polymerase antibody (diluted 1:100 in TBSTT) were added sep- 
arately to each nitrocellulose filter for 5 hours at room temperature. The filters were washed 3x3 minutes with 
TBSTT and then reacted with a 1:7500 dilution of anti-mousse secondary antibody conjugated with alkaline 
phosphatase (Promega) in TBSTT for 1 hour at room temperature. The nitrocellulose filter was developed with 

30 NBT/BCIP as instructed by the manufacturers (Promega). The results using Taq monoclonal are shown in Fig- 
ure 17. Figure 17 is a Western blot of crude lysates from T. litoralis (V), Pyrococcus sp. G-1-J (J), and Pyro- 
coccus sp. G-1-H (H), or purified polymerases from Pyrococcus sp. GB-D (DV), T. aquaticus (T) or E. coli (E) 
reacted with affinity purified anti-I litoralis DNA Polymerase antibody in Part A or anti-Taq DNA polymerase 
monoclonal antibody in Part B. The arrow indicates the position of the T. litoralis and Pyrococcus sp. DNA Poly- 

35 merase proteins. The reactivity in Part B is to background proteins and not to the DNA polymerases as seen 
in part A. 

Monoclonal antibody specific to Taq DNA polymerase does not cross-react with protein fonm the Pyrococ- 
cus and Thermococcus strains tested. 

However, the 90-95,000 dalton DNA polymerase proteins from T. litoralis and the 3 Pyrococcus strains re- 
40 acted with the affinity purified anti-I litoralis DNA polymerase antibody. This is not surprising, considering the 
high degree of both similarity and identity between T. litoralis and Pyrococcus sp. GB-D DNA polymerases (Fig. 
19), 

Figure 19 is a comparison of a portion of the deduced amino acid sequences of recombinant 7^ litoralis 
and the partial sequence of recombinant Pyrococcus sp. DNA polymerase. The Pyrococcus DNA polymerase 
45 ' deduced amino acid is listed on the upper line, and the deduced amino acid sequence of recombinant T. litoralis 
DNA polymerase is listed on the lower line. Identities are indicated by vertical lines, similariteies are indicated 
by 1 or 2 dots, nonconserved substitutions are indicated by blank spaces between the two sequences. 

EXAMPLE XVI 

50 

In order to obtain recombinant thenmostable DNA polymerase from a target archaebacterium, several basic 
approaches to cloning the target DNA polymerase gene can be followed. Initially, one attempts to determine 
immunologically whether the new polymerase is a member of the Pol a or Pol I family by Western blot analysis 
of purified polymerase (although crude polymerase lysates may work with reduced sensitivity) using anti-Taq 
55 DNA polymerase or anti- t: litoralis DNA polymerase sera, as described in Example XV of this invention (Figure 
17). If the new polymerase reacts with anti-Taq Polymerase monoclonal, then it probably cannot be easily 
doned using reagents generated from T. litoralis DNA Polymerase. If the new polymerase cross-reacts with 
anti- 7! litoralis sera, then one should be able to clone it with the following procedures. If the new polymerase 
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fails to react with either sera, then the experiment is considered inconclusive and one should go onto the next 
step, DNA cross-hybridization. 

Optimum probes and DNA hybridization conditions must be experimentally determined for each new or- 
ganism. At the same time, various restriction digests of DNA from the new organism are tested in order to find 
5 enzymes which yield fragments which hybridize to the 7". I'doratis probe and are large enough to encode the 
new polymerase. 

Probe selection can vary with respect to size and regions of the 7; litoralis DNA Polymerase gene. Optimum 
probes can be determined by performing test Southern blots as described below with large or small DNA frag- 
ments, or even oligomers. One could select probes that are totally from within the IVS sequences to look for 

10 the presence of IVSs in new archaebacterium DNA polymerase genes, or probes could be limited to mature 
polymerase coding regions. Using the entire 7". litoralis DNA Polymerase gene region as the probe has several 
advantages and disadvantages. The major disadvantage is that the larger the probe, the more likely to yield 
spurious hybridization at very low stringency. Among the advantages of using larger probes are (1) they are 
more likely to cross- hybridize to another polymerase which may have diverged gready from the I litoralis DNA 

15 Polymerase gene in one small portion of the polymerase, and (2) they are more likely to detect interna! restric- 
tion sites in the new polymerase gene since the probe spans the amino- and carboxy-termimi of the T. litoralis 
DNA Polymerase gene. It is important at the initial stages of probing to use several restriction enzymes to cleave 
the DNA from the new archaebacterium to find one or more enzymes which yield preferably one, or possibly 2 
bands, which hybridize to the T. litoralis DNA Polymerase probe and which are large enough to encode the 

20 new polymerase. The minimum coding sequence required for the new polymerase can be estimated from the 
size of the new polymerase detenmined by Western blots (assuming a factor for IVSs, if desired) or, by guessing 
at greater than 4 KB as a first approximation. Maximum fragment size is limited by the cloning capacity of the 
desired vector. 

Optimum hybridization conditions are experimentally detenmined by perfonming test Southern blots at va- 

25 rious wash temperatures. Hybridization is canried out at 50°C in 4X SET, 0.1M sodium phosphate, pH 7, 0.1% 
Na pynDphosphate, 0.1% SDS, IX Denhardts solution, although any low stringency hybridization condition 
would also be suitable (Maniatis). Wash conditions are varied from 37-55°C, 3 x 30 minutes with 0.1X SET 
wash (15mM NaCI, ZmM Tris base, O.lmM EDTA, 0.1% SDS. 0.1% Sodium Pyrophosphate and 0.1M Phos- 
phate Buffer), although any standard low stringency wash conditions can also be used. The point of this part 

30 of the experiment is to hybridize the probe and wash the Southern blot at low stringency to insure some level 
of cross-hybridization which may even include non-specific cross-hybridization. Next, one increases the wash 
stringency, for example, increasing the wash temperature in 3-5*'C increments and then monitoring the disap- 
pearance of hybridized probe as determined by a decrease in signal upon autoradiography. Initially, one ex- 
pects to see many bands hybridizing to the probe at low stringency. As the wash stringency increases, weakly 

35 hybridizing sequences melt off and disappear from the autoradiograph. As wash stringency Is increased, con- 
ditions are established at which only one or a few bands still hybridize to the probe. These are the conditions 
to be used in future experiments. As stringency increases beyond this point, all hybridization signal is lost. The 
goal is to determine the most stringent condition where one or a few bands per digest still hybridize to the probe 
before all hybridization signal is lost. 

40 If initial probing with a large T. litoralis DNA polymerase gene fragment fails to give a clear pattern using 

any hybridization conditions, then smaller probes can be tested until a good partnership of probe size and hy- 
bridization conditions are established. Alternatively, Example XIV of the present invention shows that several 
fragments spanning different regions of the 7: litoralis DNA polymerase gene (amino terminus, IVS1. IVS2 and 
carboxy terminus. Figures 1 5 and 16)) can be used in separate Southern blots, but tested in parallel at the same 

45 ' time. 

Libraries are constnjcted with the optimum restriction digests and hybridized with the optimized probe. A 
parallel approach is to clone in expression vectors and directly screen with anti-7^ litoralis sera. Either primary 
approach may yield active or inactive product. If no active polymerase is detected, the clone is checked for 
insert size and reactivity to anti-T. litoralis sera. If there is no reactivity to anti-7: litoralis sera, then the poly- 

50 merase may not be expressed from its own control sequences in E. coli and the plasmid insert must be se- 
quenced to operably link the new polymerase to an E. co// promoter and perhaps translation signals. 

In the present invention, we have identified introns or intervening sequences in Pol a consen/ed region 
motifs in both T. litoralis and Pyrococcus sp. DNA polymerase genes. We therefore predict that other Archae 
DNA polymerase genes may have introns in conserved motifs also. If the new polymerase clone is inactive, it 

55 should be checked for the presence of intervening sequences. These introns can be identified in 2 ways. If these 
introns are related to introns found in T. litoralis and Pyrococcus sp. DNA polymerase genes, they can be iden- 
tified by low stringency hybridization to DNA probes derived from intron sequences of T. litoralis and Pyrococcus 
sp. DNA polymerase genes. If IVSs are found, the clone is sequenced to develop strategies for removal of the 
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IVS. If the done is inactive and no cross- hybridizing IVSs are found, then the plasmid is sequenced to look for 
new IVSs. The archaebacterium DNA polymerase gene can be sequenced at the DNA level and the sequenced 
compared to (1) other DNA polymerases to identify non-similar segments (2) conserved motifs to look for the 
absence of Regions l-VI, followed by identification of intenruption points in Regions which are absent. Once 
5 identified, introns can be removed in vitro by any number of techniques known in the art, some of which are 
described in this application with respect to to removal of IVS1 and IVS2 from the I litoralis DNA polymerase 
gene. 

If the primary library screening fails to produce a clone synthesizing active themnostable DNA polymerase, 
but does result in a partial gene clone as detemriined by (1) cross-hybridization at the DNA level, (2) cross- 
to reactivity at the antibody level, and (3) similarity to other DNA polymerases at the DNA sequence or deduced 
amino acid sequence levels, then more genomic Southern blots are probed with the initial clone to identify re- 
striction enzymes to be selected for making the next library. The second library should contain larger fragments 
which are more likely to encode the entire polymerase gene. The library is screened with either antibody or 
preferably, the initial new polymerase cloned sequence. The resultant positives are checked for thermostable 
15 DNA polymerase activity. If no active thenmostable DNA polymerase is detected in this second round, then in- 
tervening sequences can be screened for by cross-hybridization and DNA sequencing. DNA sequencing can 
also indicate whether the cloned gene is complete by establishing the presence of all the conserved polymerase 
motifs and a stop codon in the polymerase open reading frame. Several rounds of screening and rescreening 
may be necessary before finally cloning an active thenmostable DNA polymerase. 
20 It should also be noted that the above screening and rescreening procedure may not be sufficient for cloning 

the new thenmostable polymerase gene because of toxic elements present in the gene. In this case, cross- 
reactivity at the DNA or protein level is an excellent method of cloning because only partial, inactive products 
can initially be cloned which will allow subsequent cloning of the complete gene. If obtaining the complete gene 
is not straightforward using the strategy outiined above, one should look for the presence of intervening se- 
25 quences like IVS2 which are very toxic when cloned. This is accomplished by either looking for deletions and 
rearrangements in polymerase clones or by probing for known toxic 7^ litoralis IVS sequences. Duplicate South- 
ern blots are probed with polymerase coding regions and IVS sequences to locate toxic IVSs in proximity to 
tiie polymerase coding region. If rearrangements or toxic IVSs are found, then the appropriate sb^tegy would 
be to first operably link the amino terminal of the polymerase to a very tighUy controlled expression system as 
30 described in this present application. Once accomplished, the remainder of the polymerase gene can be cloned 
and ligated to the amino terminus, reducing expression of toxic elements such as the T. litoralis IVS2 sequence. 
Alternatively, cross-hybridizing sub-fragments of the polymerase gene can be isolated, checked for IVSs by 
hybridization or DNA sequencing, IVSs can be removed in vitro from these regions by methods known In the 
art. The complete polymerase gene can then be constructed by ligation of sub-fragments from which toxic ele- 
35 ments have been removed. 



Claims 

40 1. Recombinant thermostable DNA polymerase from archaebacteria which is encoded by a DNA sequence 
which hybridizes to a nucleotide sequence selected from the group consisting of the nucleotide sequence 
of Figure 6 or a portion thereof, nucleotides 1 to 1274 of Figure 6, nucleotides 1269 to 2856 of Figures 6 
and nucleotides 2851 to 4771 of Figure 6. 

45 ' 2. The recombinant thermostable polymerase of claim 1 , wherein the portion of the nucleotide sequence of 
Figure 6 is at least about 20 nucleotides in length. 

3. The recombinant thermostable polymerase of claim 1 , wherein the portion of the nucleotide sequence of 
Figure 6 is at least about 50 nucleotides in length. 

50 

4. The recombinant thermostable polymerase of claim 1 , wherein the portion of the nucleotide sequence of 
Figure 6 is at least about 150 nucleotides in length. 

5. Recombinant thermostable DNA polymerase from archaebacteria which hybridizes to an antibody probe 
which has antigenic specificity to T. litoralis DNA polymerase. 

55 

6. Isolated DNA which codes for the recombinant thenmostable DNA polymerase of daim 1 

7. A doning vector comprising the isolated DNA of claim 6. 
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8. A host cell transformed by the vector of claim 7. 

9. A method for producing a recombinant thermostable DNA polmerase from archaebacteria comprising cui- 
turing a host cell transformed with the vector of claim 7 under conditions suitable for the expression of the 
DNA polymerase. 

10. A DNA probe which hybridizes to the DNA sequence coding for the archaebacteria thermostable DNA poly- 
merase of claim 1 , wherein the DNA probe is selected from the group consisting of the nucleotide sequence 
of Figure 6 or a portion thereof, nucleotides 1 to 1274 of Figure 6, nucleotides 1269 to 2856 of Figure 6 
and nucleotides 2851 to 4771. 

11. The DNA probe of claim 10, wherein the portion of the nucleotide sequence of Figure 6 is at least about 
20 nucleotides in length. 

12. The DNA probe of claim 10, wherein the portion of the nucleotide sequence of Figure 6 is at least about 
50 nucleotides in length. 

13. The DNA probe of claim 10, wherein the portion of the nucleotide sequence of Figure 6 is at least about 
150 nucleotides in length. 

14. A method for isolating DNA coding for thenmostable DNA polymerase from archaebacterium comprising 
the steps of: 

(a) fonming a genomic library from the archaebacterium; 

(b) transforming or transfecting an appropriate host cell with the library of step (a); 

(c) contacting DNA from the transformed or transfected host cell with a DNA probe selected from the 
group consisting of the nucleotide sequence of Figure 6 or a portion thereof, nucleotides 1 to 1274 of 
Figure 6, nucleotides 1269 to 2856 of Figure 6 and nucleotides 2851 to 4771; 

(d) assaying the transfomied or transfected cell of step (c) which hybridizes to the DNA probe for DNA 
polymerase activity; and 

(e) isolating a DNA fragment which codes for the thermostable DNA polymerase. 

15. A method for isolating DNA coding for thennostable DNA polymerase from archaebacterium comprising 
the steps of: 

(a) fonning a genomic library from the archaebacterium; 

(b) transforming or transfecting an appropriate host cell with the library of step (a); 

(c) contacting extract from the transfomned or transfected host cell with an antibody probe which has 
specific affinity for 7^ litoralis DNA polymerase; 

(d) assaying the transformed or transfected cell of step (c) which is cross-reactive to the antibody probe 
for DNA polymerase activity; and 

(e) isolating a DNA fragment which codes for the thermostable DNA polymerase. 

16. A method for increasing the expression of a thenmostable DNA polymerase from archaebacteria compris- 
ing the steps of: 

(a) identifying and locating any intervening nucleotide sequence in the isolated DNA of claim 14 or 15; 
and 

(b) removing the intervening nucleotide sequence from the isolated DNA. 

17. The method of claim 16, wherein the archaebacteria comprises T. litoralis. 

18. The method of daim 17, wherein the intervening nucleotide sequence is selected from the group of IVS1, 
IVS2 or IVS1 and IVS2. 

19. The method of claim 16, wherein the intervening nucleotide sequence is identified and located with a DNA 
probe coding for an intron from the DNA sequence encoding T. litorafis DNA polymerase. 

20. The method of claim 19, wherein the DNA probe is selected from the group consisting of a 1614 bp nu- 
cleotide sequence of Figure 6 comprising nucleotides 1776 to 3389 or a portion therof and an 1170 bp 
nucleotide sequence of Figure 6 comprising nucleotides 3544 to 4703 or a portion thereof. 

21 . A thermostable endo nuclease obtainable from T. litoralis which cleaves double-stranded deoxy nucleotide 
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acid pBR322 at position 164 and 2411. 

The endonuclease of claim 21 , having a molecular weight of about 33,000-37,000. 
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Fig. 1A, SDS-Polyacrylamide Gel of Purified 
T, litoralis DNA Polymerase 



1 2 




Lane 1: Molecular weight markers 

Lane 2: Purined T. litoralis DNA Polymerase 
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MOLECULAR WEIGHT MARKERS 



SIZE DETERMINATION OF T. lltoralis DNA POLYMERASE 

FUNCTIONS 

FIG. IB 
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O THERMOCOCCUS litoralis 




MINUTES AT lOCC 



THERMAL STABILITIES OF DNA POLYMERASES 

F1G.3A 
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DNA POLYMERASES:!*) KLENOW FRAGMENT OF E. coli 

(A) T. litorolis 
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RESPONSE OF DNA POLYMERASES TO THE PRESENCE OR 
ABSENSE OF OEOXYNUCLEOTIOES 



FIG. 4 
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GAMTCGCGA TAAAATCEAT TTTCITCCIC GAmTTCAA TTTCAAAAAC GTRAGCATCA 60 

GOOyy^OCTC TOGOOTTIC TCTGrOCrTC OOGCIMCCC TCITCAAAAC TCICIOIAAA 120 

GcarriTrrG addgaamctc msciocict mcmggkza c7rMATCix;c aatc^^^ctitcg lao 

TCAAGGCTTA TICTCTAGAA CAAdOCMG MTITOGAIT TGGATOGGGG ™a?.WTT 240 

TGGOGGAACT TTTATTIMT TTGAACTOCA GTTmrATCr GGTCCTATTT AaX3AT7..CIt3G 300 

ACACTCATTA CAIMCAAAA GATCGCAAGC CTAI7\ATCCG AATTTTrMC AAAGr.G7^ 360 

GGGMTITAA AAIASAACIT GACCCrCATT TICAGOCCIA TATAXATCCT CTTCTCAAAG 420 

AraCTCDOCSC TATTGAGGM AXAAA3GCAA TAAAGGGOSA GAGACVTCGA AAAACTCTTCA 480 

GACTGCrOGA TCCAGTCAAA GTCAGGAAAA AAimTGGG AAGGGAAGIT GAAGICTGGA 540 

MCrCATTIT OC=^CC=\TOOC CMGACXTTTC CAGCTMraOG GGGCAAAATA AGGGT^J'.CATC 600 

CMcrc3TGCT Tcy^^rrrM cAMAimcA TAOcrriTGC CMjGCcjrrAT cirArr.GT^ seo 

AGGGCTTGAT TCOCATGGT-X; GGAGAOGAGG AGCTTMGCT CCTTGOCnT GATATTGAAA 720 

CCTTITATCA TGAGGGAGM- GMnTGGAA Pj3GGOG7>GAT AAIMTCATT AGITArC<:X2S 780 

ATCAMAAGA GGOCAGACTA AIC^CATGGA AAAATATCGA TITGCC3GTra GrcGATCTTG 840 

TCTOCAATGA AAGAGAAATG AXAAAGCCTT TTCTTCAACrr TGTTAAAGAA AAAG?..CCCCG 900 

ATCTGATAAT AACTTACAM' GGGGACAATT TTGAnTGCC GTATCTCATA AAACGGGCAG 960 

AAAAGCTGGG ACTTCGGCTT GTCITT^GGAA GGGACAAAGA ACATCCCGAA CCC7.;"^^T^C 1020 

AGAGGATGGG TGAlSOTlTr GCTGrTGGAAA TCAAGGGTAG AATCCACTIT GATCTTTTCC 1080 



FIG. 6-1 
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CMITCTGOG AAC3CaOGM3V AMCTOCCAA CC72M3VCGCr atS«3GCAGrr 


tatgaagcag 


1140 


1Tn3\GG;\AA AACCAAAAGC AflAriAGCTG CAGAGGAAAT TCCCGCI7\T7V 


TGGGAAACAG 


1200 


AAGAAAGmr GAAAAAACTA GCaZACTACT CAATGGAAGA TGCTRGGGCA 


ACXTTATGAGC 


1260 


'TOGGGAAGGA AITCTIOOOC ATCGAAGCTG AGCTCGCAAA GCTCATAGCT 


CAAACroTAT 


1320 


GGGAOSTCTC GAGATCAAGC AOXGCAACC TCCTGG?CTX3 CTMCTiriA 


AGGGTGGCAT 


1380 


ACGCGAGGAA TGAACTTGCA CC3GAAOAAC CTCAimGGA AGACTATAAA 


CGGCC-CTTAA 


1440 


GAACAACITA CCTGGGAGGA TAItTTAAAAG AGCCAGAAAA AGCTITCTCG 


GAAAATATCA 


1500 


riTATITGGA TTTCCGCAGT CTGrnCXXriT CAATTUVTACTr TACICACAAC 


GTATCCCCAG 


1560 


MACCCTTGA AAAAGAGGGC TCTTAAGAATT AOGATXnTCC TCCaATACTTA 


GGATTvTr^GGT 


1620 


TCXGCAAGGA CTTTCCCGGC TITATIOOCr CCATACTCGG GGACTTAArT 


GGAATGr^GGG 


1680 


AAGAIAIAAA GAAGAAAATC AAATOCACAA TTCACCCGAT CGAAAAGAAA 


ATGCTCGATT 


17 40 


MIAGGCAAAG GGCrATI7U\A TTGCITGCAA ACAGCATCTT ACCCAACGAG 


TGG37I7.CCAA 


1800 


TAATTGAAAA TGGAGAAATA AAATTCGTGA AAATTCGCGA GmATAAAC 


TCTTACATGG 


1860 


AAAAACAGAA GGAAAAOGTT AAAACA<7I7.G AGAATACTGA ACTTICTCGAA 


GTAAACXACC 


1920 


x ixi iGCATr CTCArrCAAC AAAAAAATCA AAGAAAGTGA AGTCAAAAAA 


GTCAATxGCCC 


1980 


TCATAAGACA TASCTATAAA GGGAAAGCTT ATG5.GATTCA GCTTAGCICT 


GGTAG7'AAAA 


2040 


TTAACAIMC TCCTGGCCAT ACTCTCiTrA CAGITAGAAA TGG?jGAAATA 


AAGGA^^GTTT 


2100 


CTGGAGATGG GAXAAAAGAA GCTGACCTTA TTGIAGCACC AAAGAAAATT 


AAACrC7^J\TG 


2160' 


AAAAAGGGGT AJSjGCAXAAAC ATTCXXXIAGT TAATCTCAGA TCTTTCCGAG 


GAAG?-^w\CAG 


2220 


CCGACAITGT GATGACGATT TCAGCX2\AGG GCAGAAAGAA CriCl'l'l'AAA 


GGATiTGCTGA 


2280 



FIG. 6- 2 
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GMCTnMG GTOGaiCTrr GGAGAAGAAA MAGAAGGAX AAGAACATIT AATCGCTATT 2340 

TGITOCaTCr CGAAAAACTA GGCCITATCA AACTRCTGOC CCGOGGM!3\T GAAGn?»CTC 2400 

ACTGGGAGSG TKEAAACMC ITEAOIAGAA GCnGCTGGA AGCGTIMGr 246Q 

ACMCGGAAA GRACamGM TMTr?JGT3\A TCTTCAACXIA GATCAAGGAT TTTATT^TCIT 2S2Q 

ACTTOCCACA AAASmSCTC GAAGAATGGA AAATTGGAAC TCTCAATGGC TTTAGA-ACm 2580 

ATrGTATTCT CAAAGTX3GAT CTjCGATTITG GGAAGCXOCT AGGITACTAT CnTAGTGAGG 2640 

GCTATGCAGG TGCACAAAAA AAT?AAACTG GTGGTATCRG TTATTOGGTG AAGCTITACA 2700 

AIGAGGACCC TAATCnCTT GAGAGCATGA AAAATGITGC AGAAAAATTC TirGGCAAGG 2760 

TTAGAGTTGA C?J3AAA1TGG CTAAGTATAT CAAAGAAGAT GGCATPCTTA GTTATGAAAT 2820 

GOCTCTGTGG AGCATTAGOC GAAAACAAGA GAATICCTTC TGTT7VTACTC ACCTCTOCCG 2880 

AACOGGIACG GTGGTCAnT TIMAGGCGT ATTITACAGG CGATGGAGAT ATACTiTCCAT 2940 

CAAAA?CGIT T?J3GCrCTCA ACAAAAAGCG AGCTCCITGC AAATCAGCIT GTGTrdTGC 3000 

TGAACTCTTT GGGAAIATCC TCTGtAAAGA T?dGGCnTGA Cr-^TrGGGGTC TATAGAGTGT 3060 

AIAIAAATGA AGAOCTCCAA TTTCCACAAA OGTCTAGGGA GAAAAACACA ITjCTACTCTA 3120 

ACTTAATTOC CAAAGAGATC CTIAGGGACG TGTITGGAAA ^dSAGTTCCAA AAGAJ^vOvTCA 3180 

CGTICAAGAA ATTIAAAGAG CTTGTTGACT CTGGAAAACT TTVACAGGGAG 7>AAjGCCAAGC 3240 

TCTTGGAGIT CTTCATTAAT GGAGATAITG TCCTTGACAG AGTCAAAAGT GTTAA?-.GAAA 3300 

AGG?£M7iTGA AGGGTATGTC TATGACCTAA GCGTTGAGGA TAAC33AGAAC TITCrTGTTG 33 60 

CjmTGGTTT GCTCTATGCr CACAACAGCT ATEACGGCIA TATGGGOTAT CCr7J'%GGCAA 3 420 

GATGGTACrC GAAGGAATCT GCTGAAAGCG TTACCGCATG GGGGAGACAC T7X:r%'rAGAGA 3480 

FIG.6-3 
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TC^OGMMG AGAAM^ GAAAAGITOG GCmMGoT GACTUTICTCr 
CAGG?^=AAM TC^^GATCAm MAAGGCAAA AOGGAAMAT TAGATlTCra AAAATOU^gg 

ATCrrrrcrc imggiggac msuxMTG gcgaaaaaga AmrrocArr crcGAAccrrc 

TOMGC^Cr AACICIt^GAC GATGACGGAA AGClTClCItS GAAGCCOTIX:: CCCITVOTKIA 
TCAGGCACAG AGOGAAOMA AGAAICTIOC CXMCTGGCT GACCAAC?^ •IX3GI7^'I7,I7^G 
ATCTTIT^CTGA GGATCAITCr CTCAraGGCT ATCTAAACAC GICAAAAACG AAAACIX3CCA 
AAAAAATOGG GGAAAGP^ AAGGAAGTAA AGCCITITCA AITAGGO.AA GO^THVAAAT 
OGCICAmTC OCCAAATCCA OOTITAAAGG ATCAGAAIK CAAAACTAGC GAAATTVCCT^ 
l^AAAAITCPG GG?^3Cia?n^ GGATTGATTG TMGAGAIXSG AAACIX3GGCT GGAGA-nXTIC 
GTTGGGCAGA GTATrATCrr GGAdTTCAA CAGGCAAAGA TGCAGAAGAG AITXAAGCTNAA 
AACirCTGGA AOOOCTAAAA ACITATOGAG IIAATCTCAAA CrATI7\CCCA AAAA^OS^^ 
AJ«3GGGACIT CAACATCITG GCAAAGAGCX: TrCTAAAGIT TMGAAAAGG CACirrr-.-GG 
AOGAAAAAGG AJGAOGAAAA ATTOCSGACTr TCXTCXATCA GCTTCCGGrT ACn70-.T7G 
AGGCATITCr AOGAGGACIG TTTrCAGCTG ATCSCTACICT AACIMOGG AAGGG?.GITC 
CAGAGATCAG GCIAACAAAC ATTGATGCTG 7CTrrCTA;>G GGr^;\CTA?GG A.'^GCnCICT 
GGATTCTrGG AATTTCAAAT TCAATATITC CTG^^S^TTPC TCCAAA2X3GC T?0\ATX^GTG 
TTTCrrPCTGG AACCEACTCA AAGCATCTAA GGATCAAAAA TAAGIGGCGT rrTGCTSAAA 
GGATAGGCrr TTIAATOGAG AGAAAGCAGT. AG^GACmT AGAACATTTA AAA-KT-GCGA 4560 
GGGTAAAAAG GAAXAOCATA GAmTGGCT TTGATCTIGr GCATGTGAAA AAAGICGT^AG 4620 
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MamcmnV CGAGGCTITAC GirTarcaCA TTGAaGICGA AGACaCGCAT agcjitctitc 46a0 

CAMCAACAT OCTGGIACAC AATACICTCG GCITITATCC CACAATACCC GGGGAAAAGC 4740 

CIGAACTCAT IMAAAGAAA GOCAAGGAAT TOCTAAACTR CATAAACTCC AAACITCCT^ 4800 

GICTSCITCA GCTTGAGXAT GAGGGCITIT ACITGAGAGG ATTCTriCTr ACAAAAAAGC 4860 

GCTATGCACT CAIAGATGAA GAGGGC?XXSV TAACAACAAG GGGCTTOGAA GTATOiiXSGA 49Z0 

GAGATTGGAG TGAGATAGCT AAGGAGPjCTC ^GGCAAAGGT TXTAGAGGCT AITUrTTTwiJ'^ 4980 

AGGGAAGTCT TCAAAAAGCT GTAGA/VHTG TTAG?gGATCT TCTAGAGAAA ATAGCAA?-?.T 5040 

ACAGGGTTCC ACTTGAAAAG dTGTTATCC ATGAGCT^GAT TACCAGGGAT TTAAAGGACT 5100 

7«ru\AGOCAT TGGCCCTCAT GTCGCGATAG CAAAAJiJGACT TGOCGCAAGA GGGATTuAAAG 5160 

TGAAMCGGG CACAATAAO^A AGCTATAT03 TTCTCAAAGG GAGCGGAAAG ATOSGCGATA 5220 

GGGXAAmr ACITACAGAA TACGAOXXTA GAAAACACAA CTACGATCCG GACTACTACA 5280 

TAGAAAACXIA AGTnTGCOS GCTVCTACITA GGAO^iyCTOGA AGCGTITGGA TTXCAGrJ^w^GG 5340 

AGGATTTAAG GXATCAAAGC TCAAAACAAA CX33GCTTAGA TXSCTlTGGCTC AAG7-GGTAGC 5400 

TCTGTTGCrr TITACTCCAA GTTTCrCCGC GAj;jrCrc:TC:r ATCICICTTT TCTArZCTGC 5460 

TArCTGGTTT TCATICACTA TTAAGrwSTC CGCCAAAGCC ATAACGCTTC CAATTCCAAA 5520 

CTTGAGCTCr TTCC?JC?rcrC TGGOCTCAAA TTCACTCCAT GimTGGT.T (XTCGCTTCT 5530 

CXXTCTTCTG CIT^AGCCTCT CGAATLTi'll' TCITGGCGAA GAGTGXAC^G CTATCIATGAT 5640 

TATCrCTTCC TCTGGAAACG GAlXJi'i'IAAA OGTCTGAATT TCATCTAGAG AOCTC;:,CrCC 5700 

GTCGATTAT^^ ACTGCCTTGr ACTTCTTOjG TAGTrCmT ACCTTTGGGA TCGTrJ-ATTT 5760 

TGCCAOGGCA TTGTOCCCAA GCTCCTGCCT AAGCTGAAOXJ CTCACACrGT TCATT^XTTC 5820 

GGGAGITCTT GGGATCC 5837 

FIG. 6-5 
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FIGURE L3 
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Pyrococcus sp, DNA Polymerase Gene Region 
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ANTI-T.LITORALIS POLTOEIIASE 
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ANTI-TAO POLYMERAS 
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GGATCCCTCTCTTTTTGGTAACCCCATACGTCATTCCCTCAACCAAAACT 
TCAGCATCGTTGCAGTGGTCAGTGTGTCTGTGGGAGATGAAGAGGACGTC 
GATTTTTCTGGGGTCTATCTTGTATCTCCACATTCTAACTAACGCTCCAG 
GCCCAGGATCAACGTAGATGTTTTTGCTCGCCTTAATGAAGAAGCCACCA 
GTGGCTCTTGCCTGCGTTATCGTGACGAACCTTCCACCACCGCCACCGAG 
AAAAGTTATCTCTATCATCTCACACCTCCCCCATAACATCACCTGCTCAA 
TTTTTAAGCGTTCTTAAAGGCTTAAATACGTGAATTTAGCGTAAATTATT 
GAGGGATTAAGTATGATACTTGACGCTGACTACATCACCGAGGATGGGAA 
GCCGATTATAAGGATTTTCAAGAAAGAAAACGGCGAGTTTAAGGTTGAGT 
ACGACAGAAACTTTAGACCTTACATTTACGCTCTCCTCAAAGATGACTCG 
CAGATTGATGAGGTTAGGAAGATAACCGCCGAGAGGCATGGGAAGATAGT 
GAGAATTATAGATGCCGAAAAGGTAAGGAAGAAGTTCCTGGGGAGGCCGA 
TTGAGGTATGGAGGCTGTACTTTGAACACCCTCAGGACGTTCCCGCAATA 
AGGGATAAGATAAGAGAGCATTCCGCAGTTATTGACATCTTTGAGTACGA 
CATTCCGTTCGCGAAGAGGTACCTAATAGACAAAGGCCTAATTCO-^.TGG 
AAGGCGATGAi.GAGCTCAAGTTGCTCGCATTTGACATAGAAACCCTCTAT 
CACGAii.GGGGAGGAGTTCGCGAAGGGGCCCATTATAATGATAAGCTATGC 
TGATGAGGAAGAAGCCAAAGTCATAACGTGGAAA;iJ\GATCGATCTCCCGT 
ACGTCGAGGTAGTTTCCAGCGAGAGGGAGATGATAAAGCGGTTCCTCAAG 
GTGATAAGGGAGAJi^-AGATCCCGATGTTATaji.TTACCTACAACGGCGATTC 
TTTCGACCTTCCCTATCTAGTTAAGAGGGCCGAAAAGCTCGGGATAAAGC 
TACCCCTGGGAii^.GGGACGGTAGTGAGCC.^JiAGATGCAGAGGCTTGGGGAT 
ATGACAGCGGTGGAGATAJIAGGGAAGGATACACTTTGACCTCTACCACGT 
GATTAGGAGAACGATAAACCTCCCAACATACACCCTCGAGGCAGTTTATG 
AGGCAATCTTCGGAAAGCCAAAGGAGAAJIlGTTTACGCTCACGAGATAGCT 
GAGGCCTGGGAGACTGGAAAGGGACTGGAGAGAGTTGC.AAJVGTATTCA-AT 
GGAGGATGC.AAAGGTAACGTACGAGCTCGGTAGGGAGTTCTTCCCAA.TGG 
AGGCCCAGCTTTCAAGGTTAGTCGGCCAGCCCCTGTGGGATGTTTCT.AGG 
TCTTC?-ACTGGCAACTTGGTGGAGTGGTACCTCCTCAGGAAGGCCTACGA 
GAGGPlATGAATTGGCTCCAAACAJ^GCCGGATGAGAGGGAGTACGAGAGAA 

ggctaagggagagctacgctgggggatacgtta-aggagccggagaa-i.ggg 
ctctgggaggggttagtttccctagatttcaggagcctgtacccctcgat 
aata-atcacccataacgtctcaccggatacgctga-acagggaagggtgta 
gggaa.tacgatgtcgccccagaggttgggcaca-a.gttctgc^-aggacttc 
ccggggtttatccccagcctgctcaj^.gaggttattggatga-a-aggcaji'.ga 
hat a---a_---.g g aj-.g at g aaagc tt ct aaj'.g ac c c aatc g ag a^g aj^g at g c 
ttgattacaggcaj^cgggcaatcp-aa-atcctggcaj\a.cagcattttaccg 
gaaga-atgggttccactaattaajila^cggtaji_agtta-agatattccgcat 
tggggacttcgttgatggacttatgaaggcgaaccaaggajuvagtgaaga 
aaacgggggatacagaagttttaga-agttgcaggaj\ttcatgcgttttcc 
tttgacagga-agtccaa.gaaggcccgtgta-atggcagtgaaagccgtgat 
aj\gacaccgttattccggaaatgtttataga.atagtcttaji-actctggta 
gaaa-aataa.caata-acagaagggcatagcctatttgtctatagga^cggg 
gatctcgttgaggcaactggggaggatgtca-a.aj\ttggggatcttcttgc 
agttccaagatcagtaaacctaccagagaji-aagggaacgcttgaatattg 
ttgaacttcttctgaatctctcaccggaagagacagaj^gatataatactt 

ACGATTCCAGTTAAAGGCAGAAAGAACTTCTTCAAGGGAATGTTGAGAJ^lC 
ATTACGTTGGATTTTTGGTGAGGAAAAGAGAGTAAGGACAGCGAGCCGCT 
ATCTAA.GACACCTTGAAAATCTCGGATACATAAGGTTGAGGAAAATTGGA 
TACGACATCATTGATAAGGAGGGGCTTGAGAAATATAGAACGTTGTACGA 
GAAACTTGTTGATGTTGTCCGCTATAATGGCA-ACAAGAGAGAGTATTTAG 
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TTGAATTTAATGCTGTCCGGGACGTTATCTCACTAATGCCAGAGGAAG.\A 
CTGAAGGAATGGCGTATTGGAACTAGAAATGGATTCAGAATGGGTACGTT 
CGTAGATATTGATGAAGATTTTGCCAAGCTTCTTGGCTACTATGTGAGCG 
AGGGAAGTGCGAGGAAGTGGAAGAATCAAACTGGAGGTTGGAGTTACACT 
GTGAGATTGTACAACGAGAACGATGAAGTTCTTGACGACATGGAACACTT 
AGCCAAGAAGTTTTTTGGGAAAGTCAAACGTGGAAAGAACTATGTTGAGA 
TACCAAAGAAAATGGCTTATATCATCTTTGAGAGCCTTTGTGGGACTTTG 
GCAGAAAACAAAAGGGTTCCTGAGGTAAXCTTTACCTCATCAAAGGGCGT 
TAGATGGGCCTTCCTTGAGGGTTATTTCATCGGCGATGGCGATGTTCACC 
CAAGCAAGAGGGTTCGCCTATCAACGAAGAGCGAGCTTTTAGTAAATGGC 
CTTGTTCTCCTACTTAACTCCCTTGGAGTATCTGCCATTAAGCTTGGATA 
CGATAGCGGAGTCTACAGGGTTTATGTAAACGAGGAACTTAAGTTTACGG 
AATACAGAAAGAAAAAGAATGTATATCACTCTCACATTGTTCCAAAGGAT 
ATTCTCAAAGAAACTTTTGGTAAGGTCTTCCAGAAAAATATAAGTTACAA 
GAAATTTAGAGAGCTTGTAGAAAATGGAAAACTTGACAGGGAGAAAGCCA 
AACGCATTGAGTGGTTACTTAACGGAGATATAGTCCTAGATAGAGTCGTA 
GAGATTAAGAGAGAGTACTATGATGGTTACGTTTACGATCTAAGTGTCGA 
TGAAGATGAGAATTTCCTTG 



FIGURE 18 (couc.) 



61 



EP 0 547 920 A2 



BESTFIT comparison of the deduced amino acid sequences of 
Pyrococcus DNA Polymerase (top line) to T.litoralis DMA Polymerase (bottom line) 

BESTFIT of: Pyrococcus . Pep from: 1 to: 10X9 

TRANSLATE DNA of: Pyrococcus . seq from: 363 to: 3420 
to: T.litoralis .Pep from: 1 co: 1100 

TRANSLATE DNA of: T . litoralis . seq from: 291 to: 5401 

Percent Similarity: 83.219 Percent Identity: 68.302 

1 MILDADYITEDGKPIIRIFKKENGEFECVEYDRNFRPYIYALLKDDSQIDE 50 
" " • " I I I . I I i I I I I I I I I I I I I I I : I . I . : I . I I , n , , , , , , . , . , 

1 MILDTDYITKDGKPIIRIFKKENGEFKXELDPHFQPyiyALLKDDSAIEE 50 

51 VRKITAERHGKr/RIIDAEKVRKKFLGRPIEVWRLyFEHPQDVPAIRDKI 100 

I • * M M I . I 1 : : I I | I I I • : I I I : I . MI I I I t I | : | : | | 

51 IKAIrCGERHGKT^/RVLDAVKVRKKFLGREVE\A>JKLirEHPQDVPAMRGKI 100 

101 REHSAVIDIFEYDIPFAKRYLIDKGLIPMEGDEELKLLAFDIETLYHEGE 150 

■"•')' I I " I t > I I I I I I I t I I I I I 1 I t I I I t I I I I I I I f I {:[ t I I : 
101 REHPAWDIYEYDIPFAKRYLIDKGLIPMEGDEELKLLAFDISTFYHEGD 150 

151 EFAXGPIIMISYADEEEAKVITWKKIDLPYVEWSSEREMIKRFLKVIRE 200 

I f : I I . Mi I 1 I i I I ! I I : M I t I . n I I 1 i : 111 . I I I I I I I I : . I : : I 
151 EFGKGEIIMISYADEEEARVITWKNIDLPYVDWSNEREMIKRFVQWKE 2 00 



201 KDPDVTITYNGDSFDLPYLVKRAEKLGIKLPLGRDG. .SEPKMQRLGDMT 248 

ltlMMMMMMMM:MIMM::I.IMt .IM:M:M 
201 KDPDVIITYNGDNFDLPYLIKRAEKLGVRLVLGRDKEHPEPKIQRWGDSF 250 

24 9 AVEIKGRIHFDLYHVIRRTINLPTYTLEAVYEAIFGKPKEKVYAHEIAEA 298 

" I " I M M M : . M M M M M M M M M M : M . I . M M M I . 
251 AVEIKGRIHFDLFPWRRTINLPTYTLEAVYEAVLGKTKSKLGAEEIAAI 300 

2 99 WETGKGLERVAKYSMEDAKVTYELGREFFPMEAQLSRLVGQPLWDVSRSS 3 48 

" I : • - • - f • 1 t M M : . M M M M M M M M : M M . : M M M I 
301 WETEESMKKLAQYSMEDARATYELGKEFFPMEAELAKLIGQSVWDVSRSS 350 

349 TGNLVEWYLLRKAYERNELAPNKPDEREYERRLRESYAGGYVKEPEKGLW 398 

"IIIMMM M.MMMMMI IMMM..I MMMMMM 
351 TGNLVEWYLLRVAYARNELAPNKPDEEEYKRRLRTTYLGGYVKEPEKGLW 400 

399 EGLVSLDFRSLYPSIIITHNVSPDTLNREGCREYDVAPEVGHKFCKDFPG 448 

' • = = I t M M M M I : M M M M 1 : : M M- : M M I M . : M M I M 
401 ENIIYLDFRSLYPSIIVTHNVSPDTLEKEGC.KNYDVAPIVGYRFCXDFPG 450 

449 FIPSLLKRLLDERQEIKRKMKASKDPIEKKMLDYRQRAIKILANSILPEE 498 

IMMMM.. MMMMMMMM:MMMMI 
451 FIPSXLGDLIAMRQDIKKKMKSTIDPIEKKMLDYRQRAIKLLANSILPNE 500' 

499 WVPLIKNGKVKIFRIGDFVDGLMKANQGKVKKTGDTEVLEVAGIHAFSFD 548 

l:MI.M.:M.:M:M::.|. . . : . M . . : : M M M . . : MM: 
501 WLPIIENGEIKFVKIGEFINSYMEKQKENVKTVENTEVLEVNNLFAFSFN 550 
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549 RKSKKARVMAVKAVIRHRYSGNVYRIVLNSGRKITITEGHSLFVYRNGDL 5 98 

: I I . . I . I I I : I I I : I . I . . I I I . I I I i I . I I . I I I I I . ) M : : 
551 KKIKESEVKKVKALIRHKYKGKAYEIQLSSGRKINITAGHSLFTVRNGEI 600 

599 VEATGEDVKIGDLLAVPRSVNLPEKRERLNIVELLLNLSPEETEDIILTI 648 

I . . I : : : I I M : . . I : . ; . I M . : I I . 1 I : : 1 I . I I I . I I : : I | 
601 KEVSGDGIKEGDLIVAPKKIKLNEKGVSINIPELISDLSEEETADIVMTI 650 

649 PVKGRKNFFKGMLRTLRWIFGEE.KRVRTASRYLRHLENLGVIRLRKIGY 697 

• • M M I I M I I I i I I I I : I I I I : [ : I I .III I M . I I . I : t . It 
551 SAKGRKNFFKGMLRTLRWMFGEENRRIRTFNRYLFHLEKLGLIKLLPRGY 700 

6 98 DIIDKEGLEKYRTLYEKLVDWRYNGNKREYLVEFNAVRDVISLMPEEEL 7 47 

: ' . I . I I . M : Mill.: I : I M 1 I M M I M . : : I . I I . : I : . II 
701 EVTDWERLKKYKQLYEKLAGSVKYNGNKREYLVMFNEIKDFISYFPQKEL 750 

748 KEWRIGTRNGFRMGTFVDIDEDFAKLLGYYVSEGSARKWKNQTGGWSYTV 797 

.11:111 MM M II : I II M M I II I I Ml M I I . I 

751 EEWKIGTLNGFRTNCILKVDEDFGKLLGYYVSEGYAGAQKNKTGGISYSV 8 00 

798 RLYNENDEVLDDMEHLAKKFFGKVKRGKNYVEIPKKMAYIIFESLCGTLA 847 

: II M : . : I I : . I . : : I . I II I 1! : : : I : I . I . I II II : : : . : M I . M 
801 KLYNEDPNVLESMKNVAEKFFGKVRVDRNCVSISKKMAYLVMKCLCGALA 850 

8 48 ENKRVPEVIFTSSKGVRWAFLEGYFIGDGDVHPSKRVRLSTKSELLVNGL 8 97 

I M I : M II : M . . . I 11 . I M : M . II I I : I II I M 1 I I M t I I I . I . I 
851 ENKRIPSVILTSPEPVRWSFLEAYFTGDGDIHPSKRFRLSTKSELLANQL 900 

898 VLLLNSLGVSAIKLGYDSGVYRVYVNEELKFTEYRKKKNVYHSHIVPKDI 947 

M I I II II : i . : I : I : 11 I II II I : I I : I . I . : . : . I 1 . 1 . I : : : II : I 
901 VFLLNSLGISSVKIGFDSGVYRVYINEDLQFPQTSREKNTYYSNLIPKEI 950 

948 LKETFGKVFQKNISYKKFRELVENGKLDREKAKRIEWLLNGDIVLDRWE 997 

I : : • I M I I I I : . : M I : 11 I : . I I I : I I I I I : I : : : M M II I I I . 
951 LRDVFGKEFQKNMTFKKFKELVDSGKLNREKAKLLEFFINGDIVLDRVKS 1000 

998 IKREYYDGYVYDLSVDEDENFL 1019 
: I . 1 : 1 1 I II I 1 I ::: I I I I 
1001 VKEKDYEGYVYDLSVEDNENFL 1022 
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